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Introduction 



This diploma thesis, written under the supervision of Prof. Uwe Helmke at 
Wiirzburg University, discusses some aspects of time-optimal control theory of 
bilinear systems 



U G G, where G is a Lie group, X , X m are fixed elements of its Lie algebra 

0, and Vi : [0, T] — > K are the control functions. 

The resurgence of such systems in recent years has been caused by applica- 
tions in quantum computing and nuclear magnetic resonance spectroscopy, 
which are related to the question of manipulating effectively ensembles of 
coupled spin-particles. The dynamics of such spin systems are governed by 
a Schrodinger equation which takes the form of equation (*) with G = SU (2"). 

In the focus of this work is the problem of time-optimal control of such systems, 

1. e. the question of how to steer the system from a given initial state U (0) = U 
to a prescribed terminal state Uf in least possible time. This is not quite a 
classical optimal control problem, since the control variables can be chosen 
to be arbitrarily large, so that there will be a whole subgroup K of G, all of 
whose points being reachable from identity within arbitrarily small time t. 

Non-linear optimal control problems like this play a crucial role in control the- 
ory and a number of tools have been developed in order to solve them, most 
notably the maximum principle of Pontrjagin. On the other hand, under ad- 
ditional assumptions on the class of system (*), an explicit solution can be 
obtained using methods from the theory of Lie groups and Lie algebras, such 
as the Cartan decomposition and Riemannian symmetric spaces. This is one 
approach followed in current research on this subject, cf. [ 2 1, 1 16 1 and [ 17 1. 

The goal of this thesis is to take up the geometric ideas as formulated in the 
paper 1 161 and to present them in a setting which is both mathematically rig- 
orous and accessible without assuming too many prerequisities. 

The first chapter is aimed to give an overview on the relevant parts of Lie 
Theory and Geometric Control Theory which serve as the framework for the 
subsequent analysis of the time-optimal control problem associated with sys- 
tem (*). 
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INTRODUCTION 



In Chapter 2 we follow the geometric approach of 1 161 with its idea of replacing 
the original system on the Lie group G by a reduced system on a homogeneous 
space G/K. This idea is given a precise formulation in terms of the equivalence 
theorem of Section 2.3. Although this result lacks, in contrast to Pontrjagin's 
maximum principle, a recipe of how to compute time-optimal controls explic- 
itly, it nevertheless contributes towards a solution of the control problem. This 
is mainly due to the following facts. 

(1) The equivalence theorem allows for a reduction of the dimension of 
the state space of the control system. 

(2) The space of control paramaters of the reduced system is, in contrast 
to that of the unreduced system, compact. This guarantees the exis- 
tence of controls which meet the maximality condition of Pontrjagin's 
maximum principle. 

(3) The passage from (*) to the reduced system is the first step towards 
the complete solution of the time-optimal control problem in the spe- 
cial case where the Lie algebra g enjoys additional geometric proper- 
ties, see Section 2.5. 

As indicated in (2), the problem of time-optimal control of system (*) becomes, 
after replacing it suitably by a system with bounded controls, approachable 
via the maximum principle. Our main result in Section 2.4 is a computation 
of those trajectories of the reduced systems that are, under certain additional 
assumptions, extremal in the sense of the maximum principle. However, this 
yields only a large family of candidates for a time-optimal solution of the con- 
trol problem, and it is not evident of how to determine amongst those the ac- 
tually optimal trajectories. 

In Section 2.5 we are then specializing to the situation where the Lie algebra 
q has the additional property of being semisimple and belonging to a sym- 
metric Lie algebra pair (g,t) (cf. Definition 1.2.5). This makes the reduced 
time-optimal control problem on the homogeneous space G/K accessible to a 
geometric solution. The reason for this is that now any point [Up] of the ho- 
mogeneous space G/K is contained in the projection [A] C G/K of a suitable 
abelian subgroup A of G, and one always can steer the reduced system be- 
tween any two points of [A] along a sequence of geodesies of [A] . An application 
of Kostant's convexity theorem then shows that such a choice of controls is in- 
deed time-optimal. The original problem on the group G is thus reduced to a 
control problem on a so-called flat submanifold [A] of G/K. The control prob- 
lem reduced this way involves only commuting vector fields, which makes it 
possible to solve it explicitly. 

Optimal controls for the original control system (*) may in a subsequent step 
be obtained from those for the reduced system on G/K by again utilising the 
equivalence theorem as derived in Section 2.3. 
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The final chapter is devoted to a discussion of low-dimensional examples of 
spin-systems such as one- and two-particle systems. These are well suited 
for explicit computations, but are at the same time general enough objects to 
illustrate the theory developed in the second chapter. 

I am very grateful to my supervisor Professor Uwe Helmke for constantly sup- 
porting me in writing this thesis. Also, I would like to thank Martin Kleinsteu- 
ber for a number of helpful comments on this subject. Finally, I am indebted 
to Dr. Gunther Dirr for all his commitment in reading and discussing various 
aspects of this diploma thesis. 



CHAPTER 1 



Basic Notions from Lie Theory and Geometric 

Control 

1.1. Lie Groups and Lie Algebras 

The problem of steering a quantum mechanical spin system we are interested 
in can be formulated as a control problem on a Lie group, or a homogeneous 
space. Its solution involves (amongst others) methods from the theory of Lie 
groups, Lie algebras, and homogeneous spaces. In this section I shall state only 
those definitions and theorems that will be used later. I have nevertheless 
tried to make this exposition as self-contained as possible. The results that 
will be mentioned are all standard. They can be found in the book 1 6 1 and will 
therefore be stated without proof. 

DEFINITION 1.1.1. A Lie group (G, •) is a smooth manifold G endowed with the 
operations of group multiplication • and group inversion such that the map 

(1.1.1) G X G > G, ( gi ,g 2 )^ gi .g- 1 

is smooth. 

EXAMPLE 1.1.2. The general linear group G = Gl n R of invertible, real (n x n)- 
matrices, and closed subgroups of this such as Sl n R and SO n R. 
The group G = SU n C Gl n C of unitary (n x n) -matrices of determinant 1 will 
be the most important example to us. 

DEFINITION 1.1.3. A Lie algebra is a (real or complex) vector space g together 
with a skew symmetric bilinear operation [•, •] such that Jacobi's identity holds: 

(1.1.2) [X, [Y, Z]} + [Y, [Z, X}} + [Z, [X, Y]] = for all X,Y,Ze g. 

The importance of Lie theory in many fields of mathematics and physics arises 
from the fact that there is a natural linearization of both the manifold struc- 
ture (i.e. tangent spaces) and the conjugation map (<?, h) ^ hghr 1 (giving the 
tangent spaces the structure of Lie algebra) that allows to study nonlinear 
problems on the group by translating them into linear problems on the Lie 
algebra level. These two structures are closely related by the exponential map. 

In order to associate a Lie algebra structure to a Lie group we make the 
following definition. 
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DEFINITION 1.1.4. A vecor field x on G is called right-invariant if it satisfies 

(1.1.3) x(9) = DiR g (X (1)) for all g e G 
Here R g : h<-^ hg denotes right-translation by g. 

LEMMA 1.1.5. Any right-invariant vector field x S T(TG) is smooth and com- 
plete. The set of right-invariant vector fields is closed under the Lie bracket [•, •] 
on T(TG). Any X e T±G can be extended uniquely to a right-invariant vector 
field X with X(l) = X. In particular, the space of right-invariant vector fields 
has dimension equal to dim G. 

With these preparations in mind we are in position to endow the tangent space 
at identity 1 of the Lie group G with a Lie algebra structure. 

PROPOSITION 1.1.6. Set g := T ± G. Then Qisa Lie algebra with bracket 

(1.1.4) [X,Y] := - \x,Y] (1). 

In view of Lemma 1.1.5 the following definition makes sense. 

DEFINITION 1.1.7. Let X e q and j x be the integral curve of X starting at 
7(0) = 1. Then define the exponential map 

(1.1.5) exp : g — > G 

by 

(1.1.6) exp(X) := 7x (l). 

LEMMA 1.1.8. The so defined map exp is smooth, its differential at X = being 
D exp = id fl . In particular, exp is a diffeomorphism near X = 0. 

Example 1.1.9. G = Gl n R has Lie algebra g = K" x ™ and the exponential map 
is given by cxp(X) = J2T=o T> xk - 

There is a natural smooth Lie group action of G on itself given by 

(1.1.7) a:GxG^G, (g,h)^ ghg- 1 

with the identity as a fixed point. Thus differentiating a at the identity with 
respect to the second variable yields the smooth group homomorphism 

(1.1.8) Ad:G^G/( ), g^Ad g 
with 



(1.1.9) Ad g (X) = j t a(g,exptX) 



We call this homomorphism adjoint representation of G on q. Differentiating 
Ad at the identity g = 1 yields the linear map 

(1.1.10) ad:g — ► End(X), X\ — ► ad(X) 
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with 



(1.1.11) &dX(Y) = Ad cxptx (Y) 

at t=o 

This map ad is a homomorphism of Lie algebras, i.e. 

(1.1.12) ad [X, Y] = adX o adF - adF o adX 

holds for all X, Y e g. For this reason the map ad is called adjoint Lie algebra 
representation. 

Theorem 1.1.10. For all x,Y eg, 

(1.1.13) adX(Y) = [X,Y]. 

For later use we make the following important definition. 

DEFINITION 1.1.11. On the Lie algebra g define the Killing form n to be the 
bilinear form 

(1.1.14) (X, Y) i — > (X, Y) = tr (adX o adF) . 

It is not difficult to see that the Killing form is symmetric and ad-invariant, i.e. 

(1.1.15) k{[X,Y],Z) = k(X,[Y,Z]) 
holds for all X,Y,Zeg. 

EXAMPLE 1.1.12. The Killing form ong = su(n) is given by 

(1.1.16) k(X, Y) = 2ntr(XY). 

1.2. Homogeneous Spaces, Riemannian Symmetric Spaces, and 

Maximal Tori 

Throughout this section G denotes a Lie group and K C G a closed subgroup 
(which is known also to be a Lie group in the induced topology). We first of all 
collect some facts concerning the space of left cosets 

(1.2.1) G/K:={(gK\g E G} , 
which we also call a G-homogeneous space. 

PROPOSITION 1.2.1. G/K equipped with the quotient topology is a Hausdorff 
topological space. It inherits from G the structure of smooth manifold. This 
manifold structure can be characterized as the unique differentiable structure 
such that the canonical projection 

(1.2.2) 7r : G — > G/K 

is a smooth submersion. In the language of fibre bundles, the triple (it, G, G/K) 
is a principal K -fibre bundle over G/K. 
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Proof. 1271, p. 32-33. □ 

There is a natural action of the Lie group G on the homogeneous space G/K: 

(1.2.3) ip : G x G/K — > G/K, (g, hK) i — ► ghK. 

This action is smooth and transitive on G/K. There is an important converse 
to this observation: 

THEOREM 1.2.2. (Theorem on transitive Lie group actions). Let a : GxM — > M 
be a smooth and transitive action ofG on the manifold M, and p e M arbitrary. 
Then a is equivalent to the natural action tp of G on G/Stab p cr in the sense that 
there is a diffeomorphism ip : M — » G/Stab p cr which satisfies 

(1.2.4) ip(a(g,x)) = ip(g,ip(x)) 
for all g e G and x e M. 

Proof. [27|, p. 33. □ 

We next give a criterion for the existence of a G- (left-) invariant Riemann- 
ian metric on the homogeneous space G/K. Here a metric (•, •) is called G- 
invariant if for all x e G/K, h e G and X, Y e T x (G/K) the following holds: 

(1.2.5) {X, Y) x - (DMK -)(X), D x p(h, •)(¥)) vM , 

i.e. any diffeomorphism ip(h, ■) of G/K is an isometry of (G/K, (-, •}). 

THEOREM 1.2.3. Le^ G anc? K as before, denote by g and t its Lie algebras, and 
let p := g/f = {[X] |X G g} ^/ie quotient of the vector spaces g and 6. TTien there 
exists a G-invariant metric on the homogeneous space G/K if and only if the 
closure of the set 

(1.2.6) {Ad fe :p^p,[X] » [Ad k X}\keK} 
is compact in End(p). 

Proof. 0, p. 67. □ 

Example 1.2.4. Theorem 1.2.3 can be applied in the situation of a compact 
Lie group G. Consider the action a of G x G on G which is defined by 

v((9i,92),g) ■= 91992 1 - 
G x G acts transitively on G with stabilizer 

K := Stab CT (l) = {(<?,<?) | g EG}^G. 

Thus G is (G x G)-equivariantly (in the sense of equation (1.2.4)) diffeomorphic 
to the homogeneous space (G x G) /K. Theorem 1.2.3 applies to this space 
because K is compact and thus has compact image under the continuous map 
Ad : K — > End(p), k h-> Ad k . Here p denotes the quotient of g x g with 
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t = { (X,X)\ X e g}. So the manifold G can be endowed with a metric (-, •) 
which is invariant under the action a. In particular, the subgroups G x 1 and 
1 x G act on G by isometries. This means that the metric (-, •) is invariant under 
both left and right translations, and for this reason will be called bi-invariant. 
Another invariance property of the metric (-, •) is the following. Since K is the 
stabilizer subgroup of the identity element, we see that for each k e K the map 
Di<j{k), which we define by 

D 1( j{k)(X) :=j t <T(k,e X ptX)\ t=0 

for all X G q, is an endomorphism of g. By the definition of an invariant metric 
it follows that 

(D 1 a(k)(X),D 1 a(k)(Y)) 1 = (X,Y) 1 
holds for all I,Y eg and k e K. Now 

D 1 a{k){X) = j t a(k,exptX)\ t=0 = Ad k X, 

and therefore 

(Ad k X,Ad k Y) 1 = (X,Y) 1 
for all X, Y e g and k e K. Furthermore, let Z e 6 arbitrary Then for all isl, 

(Ad exptz X, Ad exptz Y) 1 = (X,Y) 1 . 

Differentiating this equation with respect to t at t = and using Theorem 
1.1.10 yields 

(1.2.7) ([X,Z\,Y) 1 = {X,[Z,Y]) 1 , 

i.e. the so-called ad-invariance property of the metric (-,■). 

DEFINITION 1.2.5. Let g a Lie algebra (over R or C) and ECga subalgebra. 
Then (g, t) is called a symmetric Lie algebra pair, if there exists a Lie algebra 
automorphism 8 : g — > g, which is involutive, i.e. 9 2 = id, and which has t as its 
1-eigenspace. Such an automorphism 9 is called Cartan involution. 

LEMMA 1.2.6. Let (g, 6) and 9 as in the previous definition and denote by p the 
— 1-eigenspace of 9. Then g = I © p. TTus direct sum decomposition will in the 
following be named Cartan-like decomposition. It has the following properties: 

(i) (commutator relations) 

[M]ce, 
[e,p]cp, 
[p,p]ce. 

fuj Wit/i respect to the Killing form k on g, we Ziaye ^/ia^ p C t 1 and t C p- 1 . // 
moreover q is a semisimple Lie algebra (cf. Definition 1.3.3), then p = t 1 - a/zd 
£ = p- 1 , i.e. g is the orthogonal sum oft and p. 
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Proof, (i) Let X and Y eigenvectors of 9 with eigenvalues \,fi e {±1}- 6 
is a Lie algebra homomorphism, so that 

9 [X, Y] = [9(X), 9{Y)] = [XX, pY] = \[i [X, Y] , 

which implies the result. 

(ii) Let X e 6, Y e p. Then, with respect to a basis of g adapted to the direct 
sum decomposition g = t © p, the endomorphisms adX and adY are (by (i)) 
represented by matrices 

adX = | ° | and adF = 
I * 




So 

adX o adY = 








which has trace equal to 0, hence n(X, Y) = 0. This shows pet 1 and t C p^. 
Now let g be semisimple. By Definition 1.3.3, this means that the Killing form 
is non-degenerate on g. So for any subspace V of g it follows that 

dim V + dim V 1 ' = dim g. 

In particular, dim J- 1 = dim p. Now p C E 1 , and therefore p = t^. The same 
conclusion applies to give 6 = p^. □ 



Symmetric Lie algebra pairs are closely related to a special class of homoge- 
neous spaces G/K, called Riemannian symmetric spaces. Those can be charac- 
terized as Riemannian manifolds which are endowed with a G-invariant metric 
(•, •} such that there exists an isometry $ which fixes the point K e G/K and 
reverses all the geodesies through K. Any such Riemannian symmetric space 
{G/K, (•, •)) gives rise to a symmetric Lie algebra pair (g, t). 
There is a natural group homomorphism from G to the Lie group of isometries 
Isom (G/K) of (G/K, (-,-)), which is given by 

(1.2.8) ^:g^^(g,-), 

with ip(g, •) as in equation (1.2.3). In the case where the Lie group G is con- 
nected and semisimple, one can show that ip is indeed an isomorphism between 
G and the connected component of identity of Isom (G/K), cf. |21 1, p. 143. The 
Cartan involution 9 can then be viewed as an "infinitesimal isometry" which is 
related to $ as follows: 

(1.2.9) 9(X) = — ^- 1 ($o^(exptX)o$)| t=0 , 

cf. jl3l . p. 227-228. Thus the expression t h-> tp^ 1 ($ o <p (exptX) o $) can be 
regarded as an 1-parameter subgroup of G. Equation (1.2.9) then states that 
this 1-parameter subgroup is generated by 9(X). 

Symmetric Lie algebra pairs will be of importance in the sequel, since the 
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control problem we are going to discuss in Section 2.2 can be reformulated as 
a problem on a homogeneous space G/K (see Section 2.3) and then be solved, 
provided this space G/K happens to be symmetric (see Section 2.5). 



EXAMPLE 1.2.7. (cf. [10 1, Ch. X). For the following pairs (G, K) of Lie groups 
the homogeneous spaces G/K are all symmetric. Pairs in the left columns lead 
to noncompact examples, those in the middle columns to compact ones. 



Type 


G 


K 


G 


K 


dim 


AI 


SlnM. 


SO(n) 


SU(n) 


SO(n) 


i(n-l)(n + 2) 


All 


SU*(2n) 


Sp(n) 


SU(2n) 


Sp(n) 


(ra-l)(2n + l) 


A III 


SU(p,q) 


S(U(p) X U(q)) 


SU( P + q) 


S(U(p) X U(q)) 


2pq 


BDI 


SO (p, q) 


SO(p) X SO{q) 


SO(p + q) 


SO(p) X SO(q) 


pq 


Dili 


SO*(2n) 


U(n) 


SO(2n) 


U(n) 


n(n — 1) 


CI 


S Pn R 


U(n) 


Sp(n) 


U(n) 


n(n + 1) 


CII 


Sp(p, q) 


Sp(p) X Sp(q) 


Sp(p + q) 


Sp(p) X Sp(q) 


Apq 



These examples give rise to the following table of symmetric Lie algebra pairs 
(fl, 6). The action of the Cartan involution on X e g is described under 9, while 
the rightmost column shows the rank of the algebra q, which will be defined 
in Theorem 1.2.8. 



Type 


9 


t 





t 


9 


rk 


AI 




so(n) 


su(n) 


so(n) 


-X T 


71 — 1 


All 


su*(2n) 


sp{n) 


su(2n) 


sp{n) 


J Y T T- 1 


71 — 1 


A III 


SU(p, q) 


s(u(p) x u(q)) 


su(p + q) 


s(u(p) x u(q)) 




min(p, q) 


BDI 


so(p,q) 


S0(p) x So(q) 


S0(p + q) 


so(p) x 5o(q) 




min(p, q) 


Dili 


so*(2n) 


u(n) 


so(2n) 


u(n) 


J n X J n 


[\ n ] 


CI 




u(n) 


sp{n) 


u(n) 


J n X J n 


n 


CII 


Sp(p, q) 


sp(p) x sp(q) 


Sp(p + q) 


Sp(p) x 5p(q) 




min(p, q) 



All those examples lead to irreducible symmetric spaces G/K. By this we 
mean that 

• the Lie algebra g is semisimple (cf. Definition 1.3.3) and peg contains 
no ideal of g other than the zero ideal, and 

• the Lie algebra representation 

t — ►End(p), X\ — >&AX 

is irreducible. 

In fact, the above list by E. Cartan exhausts all irreducible Riemannian sym- 
metric spaces (up to 12 compact and noncompact exceptional cases). 
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The notion of a maximal torus of a compact, connected Lie group G will be of 
considerable interest in the sequel. By this we mean a Lie subgroup T C G 
which is 

(1) abelian, 

(2) compact, 

(3) maximal with respect to these properties, i.e. any subgroup T" that 
satisfies (1) and (2) and contains T already equals T. 

It is easy to see that any maximal torus is isomorphic to a product of copies of 
S 1 , hence the name. 

A maximal abelian subalgebra t of a Lie algebra q is defined to be a subalgebra 
satisfying 

(1) [t,t] = o, 

(2) any subalgebra t' that contains t and satisfies (1) is equal to t. 

The salient facts concerning maximal tori of a compact Lie group, respectively 
maximal abelian Lie subalgebras, are collected in the following theorem. 

THEOREM 1.2.8. (Torus theorem). Let G be a compact, connected Lie group 
with Lie algebra q. Then the following holds. 

(i) The equation T — exp t defines a bijective correspondence between the maxi- 
mal abelian subalgebras t of q and the maximal tori T of G. Every connected 
abelian subgroup of G is contained in a maximal torus in G, and every abelian 
subalgebra of q is contained in a maximal abelian subalgebra ofQ. 

(ii) All maximal tori in G are conjugate to each other, and Ado acts transitively 
on the set of maximal abelian subalgebras of q. Each element of G is conjugate 
to an element of a given maximal torus, and Adc(t) = Q for any maximal abel- 
ian subalgebra t of q. In particular, any two maximal abelian subalgebras 
have the same dimension. This dimension is called the rank of the Lie algebra 
q (the rightmost column of the second table in Example 1.2.7). 

Proof. [6|, Theorem 3.7.1 (iii), (iv). □ 

1.3. Root Space Decomposition and Semisimple Lie Algebras 

1.3.1. Root space decomposition of a compact Lie algebra. Through- 
out this section let jj denote the Lie algebra of a compact Lie group G. For 
simplicity we will refer to the Lie algebra q of such a Lie group as a compact 
Lie algebra, although there is an intrinsic definition of compact Lie algebras in 
terms of the Killing form, which differs from ours. For such a Lie algebra there 
exists the well-known root space decomposition into a direct sum of subspaces 
which are simultaneously invariant under the endomorphisms ad A, where X 
is an element of a maximal abelian subalgebra t of q. The properties of such a 
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decomposition will be widely used in the proof of the time-optimal torus theo- 
rem, cf. Section 2.5. 

To make the theorem on the Jordan normal form applicable it is convenient to 
pass to the complexification g c = g ® R C of g which we give the structure of 
complex Lie algebra by defining its bracket as 

(1.3.1) [A + LB, A' + LB'] := [A, A'} - [B, B'\ + i ([A, B'] + [B, A'}) 

for A, A', B, B' e g. 

There also is a unique linear extension of any A € End(g) to a complex linear 

endomorphism of gc, which will again be denoted by A. 

The existence of a root space decomposition is based on the following lemma. 

LEMMA 1.3.1. For each leg, the endomorphism a,dX £ End(gc) is diagona- 
lizable, with only purely imaginary eigenvalues. 

The proof, for which we refer the reader to 1 6 1, uses in a crucial way the bound- 
edness of the set Ado C End(g c ) to conclude that the invariant subspaces 
Qj = Cjl + Nj in the Jordan decomposition of &dX have nilpotent part N 3 equal 
to and eigenvalues cj e iR. 

THEOREM 1.3.2. (Root space decomposition of gc). Let t be any abelian subal- 
gebra of q. Set a := S t*. Then there is a finite set S = {iai, ia m } of non- 
zero real-linear forms t — ► M and a decomposition 

(1-3.2) flc = 0o®0fl Qj 

j=i 

such that Q aj 7^ {0}, and 

(1.3.3) adX(F) = ia j {X)Y 

holds for all X e t, Y e g Qj , arcd j = 0, m. Moreover, if a 6 {0} U £, £/ierc. 
—a G {0} U £, and g_ a = g a . /rc particular, g = go, so ^/ia^ g is of the form 
a + ia with a = go n g. AZso, t C a. 
77ie following commutator relations hold: 

(1.3.4) [ 8Q4 , floj ]c{ + €{0}UE, 

[ {0} , otherwise. 

Iftis maximal abelian in g, then f) := t + it = go and t = flo H 0. /re £Aa£ case 
£/iere are ^/ie decompositions 

m 

(1-3.5) 0c = F)©00^, 

3=1 

and 

(1-3.6) = t© (0a 3 ©0-a 3 ) ng, 

ayes+ 
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where S + is any subset ofY, which satisfies 

(1.3.7) a s eS + ^ - aj i S+ 

for all j = 1, m. The subspaces g aj are then called root spaces, the linear 
forms iaj are named roots. 

Proof. [6 1, p. 145-146. □ 



For any root ia £ E, the map a : t — > M defines a non-zero, real-linear form. 
Its kernel kera therefore defines a hyperplane in t, which is called the root 
hyperplane for a. The connected components of the set 

(1.3.8) t\ ( (J kera) 

are called Weyl chambers; they are open, convex polyhedral cones in t. 
We next define the Weyl group W to be the quotient 

(1.3.9) W = iV(t)/Stab(t), 
with 

(1.3.10) N(l) = {g £ G\ Ad 3 t = t} 
the the normalizer of t in G, and 

(1.3.11) Stab(t) = {ge G\ Ad g X = X VX e t} 

the pointwise stabilizer of t in G. The group turns out to be finite. Fur- 
thermore, as a consequence of the Torus Theorem 1.2.8, it can be shown that 
the isomorphism type of W does not depend on the choice of t. It is therefore 
justified to call W the Weyl group of the Lie algebra q. 
There is the following action of W = { [g]\g e N(t)} on t: 

(1.3.12) Wxt — >t, (\g],X)t — > gXg- 1 . 

The action of W is transitive on the set of Weyl chambers. This statement is 
part of the Weyl covering theorem, cf. |6|, p. 153. 

At this point we conclude those general considerations on compact Lie algebras 
and turn to compact semisimple algebras, where a refined version of some of 
the statements made before can be given. 



1.3.2. Compact semisimple Lie algebras. 

Definition 1.3.3. A Lie algebra g (over C or over R) is called semisimple if 
the Killing form 

(1.3.13) k : g x g — > g, (X, Y) i — ► tr(adX o adF) 
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is nondegenerate on g. 

The Lie algebra g is called simple if it is not abelian and does not contain any 
ideals other than {0} and g. 

Remark 1.3.4. The relation between simple and semisimple Lie algebras is 
such that every semisimple Lie algebra splits uniquely (up to isomorphism) 
into an orthogonal (with respect to the Killing form) sum of simple Lie alge- 
bras, cf. |12|, p. 23. 

Example 1.3.5. See Example 1.2.7. The Lie algebras g which appear in the 
second table are all simple. 

For the remainder of this section let g be a semisimple, compact Lie algebra 
over the reals. Furthermore, let t be a maximal abelian subalgebra of g, and 
denote by EC t* the set of roots in the root space decomposition of gc with 
respect to t. Since the Killing form is nondegenerate on g we may identify t* 
with t in the following manner: 
(1.3.14) 

t* 3 A i — ► X x et such that X(Y) = n {X\, Y) is satisfied for all Y e q. 

We will name X a eta coroot, if ia € £ is a root. The set of coroots is denoted 
by £*. One observes that the coroots are perpendicular to their respective root 
hyperplanes. 

In addition to the results of Subsection 1.3.1, the following theorem holds. 

THEOREM 1.3.6. Let t C g be a maximal abelian subalgebra of the compact, 
semisimple Lie algebra q. Then: 

(i) There exists let such that t = ker(adX). Conversely, the kernel of&dX, 

X e g arbitrary, is a maximal abelian subalgebra ofg if and only if its dimen- 
sion is the smallest one possible. Such an X e g is also called a regular ele- 
ment. In that case the dimension of ker (adJf) coincides with the rank of the 
algebra g as defined in 1.2.8. 

(ii) The set S* of coroots spans t as a vector space. 

(Hi) The root spaces in a root space decomposition of gc with respect to t are all 
l-dimensional. 

Proof. Cf. QH, p. 80 for a proof of (i), and p. 39 for a proof of (ii) and 

(iii) . □ 

Example 1.3.7. Consider the Lie algebra g = su(n). 

The complexification of g is the simple Lie algebra gc = s[„C. Choose 

t := J idiag(0!, 6 n )\ 9j eR,J2 9 i = \ 
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to serve as a maximal abelian subalgebra of g, and let fj := t + it be its 
complexification. For i ^ j set 

{1 , if T — — % j S — — J j 
0, otherwise. 

Then the root space decomposition of gc with respect to t is given by 

n: n : :/•:, ,. 

The corresponding roots are 

ctij : t — ► iM, idiag(6»i, ...,8 n ) i — > i(0* - Oj). 
Indeed, a calculation yields 

/ Wi \ / \ 

v y V o y 

The root spaces = CE t j obey the commutator relations 

Qu, if j = k,i^ Z, 

[Qij,9ki] = \ Qkj, if « = i, j 7^ 
0, else. 

>From the identification of \]* with f) via the Killing form one obtains the co- 
roots X ai] = idiag(0, 1, — 1, 0) £ I) (here 1 is the entry at the i-th, -1 at 
the j-th position). They satisfy the relations 



(« (X atJ , X ai] ) K {X akl , X au )) " 



o, if {i,j}n{M} = 0, 

— 1, if i = Z, j ^ k or i ^ I, j = k, 
= { 1, if i = fc, j 7^ Z or i ^ k, j = I, 
if i = l,j = k, 
if % = k,j = Z, 

7T 7T 27T 



so that the coroots include angles equal to 0, ^, or n. 

The Weyl group W acts on X e t by permuting the entries on the diagonal of 
X and therefore turns out to be isomorphic to the symmetric group S(n). 
On g one defines the involution 9 : X i — ► -X T . Its 1-eigenspace 6 is equal to 
the subalgebra so(n) of g, while its -1-eigenspace p comprises those matrices 
X of g which satisfy X = X T . Therefore, (g, t) is a symmetric Lie algebra pair 
in the sense of Definition 1.2.5. In fact, 9 arises as the Cartan involution asso- 
ciated with the Riemannian symmetric space SU(n)/SO(n), see Example 
1.2.7. Notice also that 

t = J2QnC(E ij +6E ij ), 

i<j 
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while 

is complementary to f) in p. This is not accidently and reflects a general 
correspondence between the Cartan-like decomposition q = £ © p and the root 
space decomposition of qc with respect to a maximal abelian subalgebra I) cp 
ofg, cf. QB, p. 336. 



We next give a modified version of the Torus Theorem 1.2.8 (ii) in the situation 
of a Lie group G with semisimple Lie algebra g, which will be of interest later 
on. 

LEMMA 1.3.8. Let G be a Lie group with semisimple Lie algebra q, and K a 
closed subgroup ofG such that its Lie algebra t together with q forms a symmet- 
ric Lie algebra pair. Furthermore, let q — 6 © p be the corresponding Cartan-like 
decomposition, and f) any maximal abelian subalgebra of p. Then 



(1.3.15) p = |J Ad fc f). 

PROOF. |10|, Chapter V, Lemma 6.3 (hi). □ 

To conclude this section we cite a theorem that relates the action of the Weyl 
group W on f) to the adjoint action of if on p. 

THEOREM 1.3.9. (Kostant's convexity theorem). In the setting of the previous 
Lemma 1.3.8, let T : p — > \) be the orthogonal projection with respect to the 
Killing form on q. Then for any I e I) 

(1.3.16) T(Ad K X) = c(W-X), 
where c denotes convex hull. 

Proof. J20|. □ 

EXAMPLE 1.3.10. Consider the symmetric Lie algebra pair (g, t) with g = su(2) 
and 

' iX 



t = 



-iX 

Its Cartan-like decomposition is g = 6 © p with 



lei c su(2). 



z 
-z 



z e C 



Fix X — I ^ Z ° ] e p, z / 0, and set f) = RX Q . It is easily seen that the 

\ -zo J 

Weyl orbit of Y = XX e I) is W ■ Y = {±Y} and the orbit of the adjoint action 
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of K on Y is 

AdxY = {kYk^l k G K) 




i.e. a circle Ccp centered at the origin and passing through Y. The ortho- 
gonal projection of C on 1) is the set {a^l — 1 < a < 1}, which in fact is the 
convex hull of {±^}- 



1.4. Some Definitions from Geometric Control Theory 

This section is aimed to introduce some basic definitions and results from geo- 
metric control theory which serves as the appropriate framework for the kind 
of problem to be considered later. Geometric control theory is primarily inter- 
ested in the investigation of controlled dynamical systems on a manifold M, 
their behaviour being governed by ODEs of the form 

(1.4.1) x = f(x(t),u(t)), x(0)=x , 

where the parameter u (the "control function") is allowed to vary with time t 
within a given parameter space U C K m (the "control set"). 
Given such a system (1.4.1) together with a control set U one naturally can ask 
the following questions. 

(1) Does there exists a control function t ^ u(t) that transfers the initial 
state x a of system (1.4.1) to a prescribed terminal state x F = x(tF)t 
Describe the set of all points in M that are reachable in this sense! 

(2) Proof the existence of time-optimal controls and give explicit construc- 
tion schemes for them. 

To make things precise we introduce some terminology. 

DEFINITION 1.4.1. A nonlinear control system S = (M, f u , U) is a triple con- 
sisting of a smooth manifold M, a parameter space U C M m and a family 

(1.4.2) {/„ e T(TM)\u e U} 

of vector fields on M. We will refer to M as the state space of the control system, 
to U as the space of control parameters, and to u e U as a control parameter. A 
control is a path t ^ u(t) in the space of control parameters. 
A curve x : [0, T] — > M is called an integral curve for the control u : [0, T] — ► 
U,ti-> u(t) if it is absolutely continuous, and if 

(1.4.3) i(t) = / u(t) (*(*)) 
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is satisfied for all < t < T. 

NOTATION 1.4.2. We frequently write f(x,u) rather than f u (x) for the value 
of the vector field /„ at the point x. 

To guarantee the existence of an integral curve as defined above, we make the 
following standing assumptions: 

• The map u i— > f(x, u) is Lipschitzian for any fixed x € M. 

• The vector field f u e T(TM) is smooth for all u eU. 

• The partial derivatives of the map (x,u) h-> f(x,u) in directions of M 
are locally bounded in any point (x Q , u ) e (M x U). 

• The control t *—>■ u(t) is measurable and locally bounded on its interval 
of definition. 

Under these assumptions, the existence and uniqueness of an integral curve 
t i > x(t) with prescribed initial condition x(t ) = x a is guaranteed by the 
Caratheodory theorem for any control 1 i— > cf. 1 1 1, p. 28-29. 

DEFINITION 1.4.3. Let E = (M, /„, 17) be a control system, x e M, and T > 0. 
We define i?(x , T) to be the set of all xf G M with the property that there 
exists a control u : [0, T] — > J7 which generates a trajectory 1 1— > a;(f) such that 
x(0) = x and x(T) = x F . We call i?(x , T) the set of reachable points from x 
at time t. 

Define furthermore the reachable set from x within time T to be 

(1.4.4) R(i ,T)= |J R(x 0) t), 

0<t<T 

and the reachable set for x to be 

(1.4.5) R(xo)= |J R(x ,T). 

0<T<c» 

The system E is called controllable, if R(.t ) = M holds. 

In the sequel we will pay attention to control problems on Lie groups G and 
homogeneous spaces G/H only. We will therefore be confronted with a special 
class of control systems. 

DEFINITION 1.4.4. A control system E = (G, /„, U), U C K m , on a Lie group G 
is called affine right-invariant if {f u } ueU is a family of vector fields on G of the 
form 

m 

(1.4.6) f(g,u) = X (g) + J2nX i (g) 

i=l 

with Xj, i = 0, to, right-invariant and u = (m, ■■■,u m ) e [/C R m . 

Now what about questions (1) and (2) formulated above in the context of Lie 
groups? There are very detailed investigations on those topics, see e.g. the 
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paper by V. Jurdjevic and H. J. Sussmann [14| where problem (1) is completely 
answered, the paper by D. Mittenhuber [ 23 1 for a treatise of question (2) as 
well as Jurdjevic's book |15|. The following is a survey of the results needed to 
tackle the question of controllability of those quantum mechanical systems we 
are finally interested in. 

THEOREM 1.4.5. (Controllability of affine right-invariant systems on Lie groups) 

Let S = (G, /„, U) with f u (g) = X (g) + YliLi u iXi{g) be an affine right invari- 
ant system on the Lie group G. Denote by g the Lie algebra of G. Then the 
reachable set R(l) is always a semi-group. IfK(l) happens to be a group then 
it coincides with S(X Q , ...,X m ), the Lie subgroup of G generated by the elements 
expX , ...,expX m e G 

Each of the following two conditions is sufficient for S(X , ...,X m ) to be a Lie 
subgroup: 

(i) Xo = (absence of a drift term). 

(ii) S(X , ...,X m ) is compact. 

If(ii) is satisfied, then there is a constant T > such that R(l) = R(l, T). 
Furthermore, under the additional assumption that G is connected, the 
following criterion on controllability holds: 

(1.4.7) S is controllable (X , X m ) Lic = g. 

Proof. |9|, Lemma 4.5 and Theorems 5.1, 6.5. □ 
A little bit more theory is needed to answer the remaining question (2). 



1.5. Optimal Control and the Maximum Principle 

In this section we take up the discussion of time-optimal control as formulated 
in question (2), Section 1.4. 

DEFINITION 1.5.1. Let S = (M,f u ,U) be a nonlinear control system and ip : 
M x U — > K a continuous function. For a trajectory t ^ (x(t),u(t)) of E with 
initial point x(t a ) = a and terminal point x(ti) = b we define the cost of transfer 
between a and b to be 

(1.5.1) [ 1 <p(x(t),u(t))dt. 

J t 

A trajectory t (x(t) : u(t)) of S that transfers a S M to b S M is called optimal 
if 1 <p(x(t), u(t)) dt is minimal amongst all costs of transfer between a and b. 
In the special case <p = 1 we refer to the corresponding cost functional as time 
and to the respective optimal trajectories as being time-optimal. 
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It is convenient to implement the cost function ip into the given control system 
E as follows. Set T, cxt ■= (R x M, f u ,U), where f u is the vector field on K x M 
given by 

(1.5.2) fu{x Q ,x) = (<p(x,u), f u (x)) . 
We call E C xt the cost-extended system for (E, 99). 

The geometric significance of the trajectories of E cxt is that optimal trajecto- 
ries 1 1— *• x(t) of £ for the transfer of a to 6 arise as the projections on M of those 
trajectories i i-> (xo(i), of E cxt that transfer (0, a) to (xoCT), &) with xo(T) 
minimal. Such trajectories t ^ (xo(t), x(t)) of the cost-extended system neces- 
sarily have their terminal point (x (T),b) on the boundary of R(0, a). We call 
this the extremality property (E) of optimal trajectories for (£, ip). 

We shall now discuss a necessary condition for a control function u to gene- 
rate a trajectory which enjoys the extremality property (E). This will lead us 
to the well-known maximum principle of Pontrjagin. To this aim we need to 
introduce some terminology from classical mechanics (cf. e.g. |22|). For 
simplicity we focus at first on the case M = R", and then extend the dis- 
cussion to arbitrary smooth manifolds M. 

1.5.1. The case M = R™. In this situation we regard the manifold N = 
R n+1 x R™ +1 as a state space on which a family H(x,p, u) of so-called Hamil- 
tonian functions, parametrized by the elements u of the space U of control 
parameters, is given. These Hamiltonian functions are defined by 

n 

(1.5.3) H(-,u) :N~ > R, (x,p) .— > u). 

We here consider the tangent vector f(x,u) as an element of R n+1 . 

Now any smooth function H : N — > K defines a Hamiltonian vector field Xh on 

N, whose coordinates are given by 



(1.5.4) 



(dH dH dH dH\ 
V9p '"'9p„' dxa'"' dx n ) 



The Hamiltonian vector field Xh can alternatively be described via the canon- 
ical symplectic form w on N. This is a closed 2-form which is defined at each 
point (x,p) e N by 

(1.5.5) u[X, Y) := X x Y p - X p Y x G R, 

for X = (X X ,X P ),Y = {Y X1 Y P ) e T {x , p) N ^ R n+1 x R n+1 . The vector field X H 
can then be defined to be the unique vector field on TV which satisfies 

(1.5.6) dH(Y)=uj(X H ,Y) 
for every vector field Y on N. 

The Hamiltonian vector field X u := X H ^.^ associated with the particular 
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Hamiltonian function (1.5.3) is in coordinates (x , ...,x n ,po, ■ ■■,p n ) of N given 
by 

d.5.7) X u = ^/ o(a . ltt ),...J B ( a , ti ) ) -^p J ?^^,... > -g ft ^^j . 

REMARK 1.5.2. The function / (•, u) = {<p(x, u), f u (x)) does by definition not de- 
pend on the variable x . Thus the term - Y^j=o Pj ( x ' u ) m equation (1.5.7) 
vanishes identically. So any integral curve t i-> (x (t), ...,x n (t),po(t), ...,p n (t)) 
of the vector field X u has constant coordinate p . The Hamiltonian function 
in a later formulation of Pontrjagin's maximum principle (cf. Theorem 2.4.2) 
will for that reason depend on the variables x , ...,x n and pi,...,p n only, while 
Po will appear as a parameter. As long as we are in the situation of a Euclidian 
state space N = R n xB", the relevant Hamiltonian function reads 

n 

(1.5.8) H(x x , ...,x n ,p 1: ...,p n ) =Po<fi{xi, -,x n ,u) + ^p l x l , 

where p is a constant. 



Now let t i-> u(t) be a control function and denote by t i-> 7(f) := 
the integral curve of the time-dependent Hamiltonian vector field X u ( t ) with 
initial value (xo,po). Equation (1.5.7) implies that the projection of 7 on the 
first factor of N = R n+1 x M. n+1 is equal to the trajectory t x(t) that arises 
from the control t i-> u(t) of the cost-extended system and has initial value 
x . The path t j(t) is for this reason called Hamiltonian lift of the path 
t i-> The existence of such a Hamiltonian lift gives rise to the idea of 

expressing the extremality condition (E) suitably as a condition on the time- 
dependent Hamiltonian function H{-,u{t)). The following definition introduces 
the correct extremality condition. 

DEFINITION 1.5.3. Let t ^ u(t) be a control function. The extremal Hamil- 
tonian M (x(t),p(t)) associated with the integral curve t (x(t),p(t)) of the 
time-dependent Hamiltonian vector field X u ^ is defined by 

(1.5.9) M (x(t),p(t)) := sup H (x{t),p(t),u) . 

ueu 

The statement of Pontrjagin's maximum principle (PMP) is the following. As- 
sume that the control 1 1-> u(t), t G [0, T] generates a trajectory t^x(t) e W l+1 
of the cost-extended system S ext which has the extremal property (E). Then the 
Hamiltonian lift oft i-» x(t) to the path t (x(t),p(t)) satisfies the extremality 
condition 



(1.5.10) 



H{x{t),p{t),u{t))=M{x{t),p{t)) 
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almost everywhere on [0, T]. 

We demonstrate how to make use of PMP in a concrete but typical situation 
(see also |1|, p. 191). 



Example 1.5.4. On M = R consider the system 
(1.5.11) xi=u, \u\<l 



or equivalently 
(1.5.12) 



x = (xi,x 2 ) £ K 2 , \u\ < 1, 
x\ = x 2 , 
^ x 2 = U. 

We wish to steer system (1.5.12) from x(0) = x to x(t F ) = such that t F 
is minimal. Thus we take tp = 1 to serve as a cost function. The family of 
admissible vector fields for our problem is 

f u (x) = (X 2 ,U). 

The Hamiltonian function H (•, u) as defined through equation (1.5.8) reads in 
this example 

H (x,p, u) = po + pix 2 + p 2 u, po constant, 
and leads to the Hamiltonian vector field 

( . _ dH(;u) 

) dP ' 

) ■ _ OH(-.u) 



(1.5.13) 



Xi = x 2 , x 2 = u, 
Pi =0, p 2 = -p\. 



The Hamiltonian system (1.5.13) that belongs to a time-optimal control t ^ 
u(t) has by PMP a solution such that 

H (x(t),p(t),u(t)) = max if (x(t),p(t),u) = max (pi(t)x 2 (t) + p 2 (t)u) . 
|u|<i M^i 

>From this it is immediate that u{t) = sgn (p 2 {t)), \ip 2 {t) ^ 0. Therefore, 

max if (x(t),p(t),u) =p 1 (t)x 2 {t) + \p 2 (t)\ . 

| u I < 1 

>From (1.5.13) it follows that p 2 (t) = a + (3t. One now solves the equations 
involving x in (1.5.13) with u(t) = sgn {p 2 {t)) to obtain the extremal trajectories 
of system (1.5.12). The result is that for any initial value x(0) = (xi(0),x 2 (0)) 
there is exactly one time-optimal trajectory t i— » x(t). This is obtained from 
choosing the control u to be u = +1 as long as 

1 2 
X X 

x\ > y> x 2 <0 or xi > — p x 2 > 
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is satisfied, and switching to u = —1 if 

x 2 

xx = — , x 2 < or 
or otherwise choosing u = — 1 as long as 

2 

x\ < x 2 < or 

holds, and then switching to u = +1. 

1.5.2. The general case. We now discuss how the previous considera- 
tions carry over to the case of an arbitrary smooth manifold M. The state 
space is now taken to be 

(1.5.14) N := T* (E x M) , 

the cotangent bundle of M x M. The manifold N carries in a canonical way 
a symplectic structure uj (i.e. a; is a non-degenerate closed 2-form), which is 
defined to be 

(1.5.15) w := d0, 
where the 1-form 9 is given by 

(1.5.16) 6 S (X) :=^(D^(X)) 

for t; E N and A" e T^iV. Here n denotes canonical projection from N onto its 
base manifold IxM, 

Using the symplectic form w one can repeat the construction of Hamiltonian 
vector fields, but now in a coordinate-free manner. For any smooth function 
H : N — > K define the Hamiltonian vector field X H associated with H to be the 
unique vector field on TV with the property that 

(1.5.17) dH s {Y)=LJz(X H ($,Y) 

holds for all £ € N and Y e T 5 7V. One can show (cf. (22 1) that in suitably de- 
fined local coordinates (so-called Darboux coordinates) the Hamiltonian vector 
field Xh is of the same form as defined in (1.5.4) for the Euclidian case. 
We again introduce a family H (•, u) : N — > R of Hamiltonian functions, para- 
metrized by the controls u e U, as 

(1-5.18) H(tu):=t(f u (n(0)). 

This definition can be shown to be consistent with that in (1.5.3), and one 
also can prove that the trajectories t i— » of the Hamiltonian vector field 
X u := Xh(., u ) are projected under n to those of the vector field / E onMx M. In 
complete analogy to the linear case we refer to the trajectories t \-> as the 
Hamiltonian lifts of the integral curves of /„. 

We finally adapt the extremality condition of the maximum principle to the 
new situation of a general state space N. 



xi = — — , x 2 > 0, 



x\ < — — , x 2 > 
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DEFINITION 1.5.5. Let t ^ u{t) be a control function. The extremal Hamilton- 
ian M (£(*)) associated with the integral curve t ^ of the time-dependent 
Hamiltonian vector field X u m is defined by 

(1.5.19) M(£(t)) := sup H(t(t),u). 

The statement of Pontrjagin's maximum principle on the relationship between 
the extremality (E) of trajectories of the control system £ and the extremality 
of their Hamiltonian lifts as formulated in 1.5.1 remains valid also in the non- 
linear case. 

For a proof and detailed discussion of PMP we refer the reader to the books 
1 1 1 and 1 15 1 and give here the precise statement of the maximum principle for 
time-optimal control problems. 

THEOREM 1.5.6. Let £ = (M, /„, U) be a control system and 1 1-> u{t), t e [0, T], 
a time-optimal control. For each u e U define the Hamilton function 

(1.5.20) #(-,u):T*M— >R, iT&u) =£(/„ (*■(£))) 

and denote by X u e T(T*M) the Hamiltonian vector field for H(-, u). Then any 
trajectory of q = /«((?) in M for the control function u possesses a Hamiltonian 
lift to a curve 1 1— > £(t) in T*M with the property that the extremality condition 

(1.5.21) H(at),u(t))=M(at)) 
holds almost everywhere on [0, T]. 

PROOF. QJ, Corollary 12.12. □ 

1.6. Kronecker Product Formalism 

In this section we develop a formalism which allows for an elegant descrip- 
tion of linear transformations on the tensor product V ® W of vector spaces V 
and W. This formalism is well-suited for calculations in quantum mechanical 
multi-particle systems. 

Notation 1.6.1. In the sequel, all vector spaces are finite-dimensional over 
the field IK = M or K = C. For short, we will always write V ® W for the 
K-tensor product of the vector spaces V and W. If V and W carry the inner 
product (•, -) v and respectively (-,-) w , then V ® W will also be regarded an 
inner product space with the induced inner product which is given by 

(1.6.1) (v ®w,v'<8> w') v ^ w = (v, v') v (w, w') w . 

We set End(y) for the vector space of K-linear endomorphisms of V. This is a 
K-Lie algebra with bracket [A, B] = AB - BA. 
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Definition 1.6.2. Let V, W vector spaces and A e End(V), B e End(W). We 
define the Kronecker product A® B e End(V <8> W) by 

(1.6.2) (A <g> B) (v <g> w) = Av <g> Bw 

for v G V, w G T4^. 
Set 

(1.6.3) X := Span K {A<g> B\ A e End(V), B G End(W)} C End(V ® W). 
LEMMA 1.6.3. Tne Kronecker product has the following properties. 

(i) For all A,B G End(V), C, L> G End(VK) and A G K, 

(A + _B) ® C = A ® C* + B <g> C, 

A<g>(C + L>) = A ® C* + A ® £>, 

(XA) ®B = \(A®B) = A<Z> (AS) , 

(A®C)o(B(Z>D) = (AoB)®(CoD). 

(ii) If {Ai}^ {Bj}. are bases of End(V), End(VK), then {A t ® Bj}^^ is a basis 
ofX. Moreover, X = End(V ® W). 

(Hi) Let (•, -} End (v) and (•, -) End (iy) &e inner products on End(V) and on End(iy). 
Then an inner product (•, •} on End(V <g> W) is defined by linear continuation of 

(1.6.4) (A ®B,A'® B') := (A, A') End(v) (S, S') Bnd(w) . 

IflAi}^ {Bj} - are orthonormal bases o/End(V) and End{W), £nen, with res- 
pect to the inner product as defined above, the set {Ai <g> Bj}^ ^ is an orthonor- 
mal basis of 'End(y (g) W). 

fiuj i*br aZZ A, A' e End(V) and B, B' G End(W) the following formula holds: 

(1.6.5) [A ®B,A'<S> B'] = [A, A'] ® BB' + A' A ® [B, B'] . 
fuj i^or a« A G End(V), B G End(W) 

(1.6.6) (A® = ® B H . 

(vi) With respect to ordered bases {v i } i ofV,{wj}. ofW, and {vi <g> Wj}^ ^ of 
V <g> W (nn£n £ne indices being ordered lexicographically), the endomor- 
phism A® B is represented by the matrix 

(1-6.7) ( r *fc s J')(ij),( fc ,i) 
A and .B are represented by matrices (r^)^ ^ and (sw)(fc iy respectively. 



(vii) For all A,B e End(V), 

(1.6.8) tr(A ® B) = tr(4)tr(B). 
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(viii) For all A,B e End(V), 

(1.6.9) A®B = P(B®A)P- 1 
with an involution P e Gl (V ® V). 

(ix) If AG Gl{V), B e Gl{W) then A®BeGl{V®W) and has inverse 

(1.6.10) (A&B)- 1 = A~ 1 ®B- 1 . 

(x) Let A, A' e End(V) and B, B' e End(V^) equivalent endomorphisms, i.e. 

A = UA'U- 1 and B = VB'V -1 for some U e Gl{V) and V e Gl{W). Then also 
A® B and A' <g> B' are equivalent with 

(1.6.11) A ® B = (U ® V) (A' ® B') (£/■ ® V)" 1 . 

(xi) For aZZ A,Be End(V), 

(1.6.12) det(A® B) = (detAdetB)", 
where n = dim V. 



PROOF, (i) This follows from the bilinearity of the tensor product, 
(ii) Let aiij e K such that J2i j a %j {^i ® Bj) = 0- So for all a,v e V, b,w e W, 
and Zj := (B^w, b) w it follows that 



= 



aij (A, <g> Bj) j (t) <g> to) , a <g> 6^ 



X! a )v (B 3 w, b) w 

= ^OL ij z j {A i v,a) v 



*,3 



^cxijZjAi v, 



1,3 



7 

' v 



Therefore, because (•, -) w is non-degenerate, 

^OijZjAi = 0. 

i,3 

Since {A i } i is linearly independent, we find that for each i 

aijZj = ^ a v (BjW, b) w = 0. 
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Because w and b are arbitrary, it follows that J2j a ijBj = 0> an ^ finally, by 
linear independence of {Bj}j, that ay = 0. Furthermore, the dimension of 

End(V) ® End(W) = Span K { A ® B\ A E End(V), B £ End(W)} 



is 

dimEnd(V r )dimEnd(VK) = (dim V) 2 (dim W) 2 

= (dim (V <E>W)f 
= dim End (y <g>W). 

So 

End(V) ® End(T^) = End (V <8> W) . 

(hi) The sesquilinearity of (•, •) follows from that of (-, -} End( y) and (-, •} End ( H /) 
together with the bilinearity of the tensor product. The bilinear form (•, •) is 
positiv definite because for all A e End(V), B E End(W) we have that 

(A ® B, A ® S) = (A, A) End(y) (B, S) End(w) > 0, 

and the last expression is equal to if and only if A = or B = 0, i.e. if and 
only if A® B = 0. 

(iv) The Lie bracket of two matrices (Xij) i3 and (Yij) i3 has matrix elements 
[X, Y] rs — J2 U XrvXus - Y ru X us . We apply this to the matrix representation of 
[A ® B, A' ® £?'] and find that 

[A ® B, A' ® fe; = ^ (A ® B) ij st (A' ® - (A' ® B%. rt ® £) at , fcZ 

St 

= E A is B jt A sk B tl - A is B 3t A sk B t i 

St 

= E (e - 
-E (e 4.^.* - 

= E ^ - E [ A ' . A ] lfc 

s t 1 

= [B, B%, E ^Xfc + [ A > A '} ik E B ' 3 tBu 

S t 

which proves the claim. 

(v) For all v, v' e V, w, w' e W we have that 

(v®w,(A H ®B H )(v'®w')) v ^ w = (v,A H v') v (w,B H w') w 

= (Av,v') v {Bw,w') w 
= ((A®B)(v®w),v' ®w') v ^ w , 
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hence (A ® B) H = A H ® B H . 

(vi) Let {vi}^ {wj} j be ON-bases for Vand W, respectively. Then {vi <g> iy J -} j • is 
an ON-basis for V <8> W and thus the matrix element (A <g> B) j • fei is given by 

(^®- B )«,fci = (■"i®w j ,(A®B)(v k ®wi)) V9W 
= (vi®Wj,Av k ®Bwi) v ^ w 
= {v i ,Av k ) v {w j ,Bwi) w 
= AfoBji, 

(vii) This follows directly from the matrix representation of A, B and A <g> B as 
given in (vi): 

= ^""^ AgBjj 

= (e^) (e^ 

= tr(A)tr(B). 

(viii) A comparison of matrix elements of A <g> B and B ® A shows that 

(A®B) ijM = {B®A) ji>lk . 

Thus a change of basis by a suitable transposition matrix P transforms A® B 
into B ® A. 

(ix) For all v e V, w e W, 

(A^ 1 ® B^ 1 ) (A ® B) (v ® w) = A- 1 Av®B~ 1 Bw 

= v (g> w 

= (A ® B) (A- 1 ® B- 1 ) (v ® w) . 

(x) This becomes clear from 

(U <g> V) (A' ® B') ([/ ® V)" 1 = (C/A'J/- 1 ) <g> (V-B'V -1 ) = A <g> B. 

(xi) The Kronecker product of a triangular matrix A e End(V) and an arbitrary 
matrix B e End(V) is the block triangular matrix A ® B, the diagonal blocks 
consisting of the (n x n)-matrices C y with 

(CyOfe, = (A (8)5)^., =A ij B kl . 
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Thus A® B has determinant 

det (A <g> B) = }~[ dot dj 

= U^t(A t3 B kl ) kl 

i,3 

= JJ^.det(B fcI ) fciI 

= (det(B w ) w ) n n^ 

= (dct (A l3 ) tj dct (B kl ) ki y , 

where in the last equation we have made use of the triangular form of the 
matrix (A^)^. Now any A e End(V) is conjugate to a triangular matrix A', so 
by (xi) we find that A <g> B is conjugate to the block triangular matrix A' <8 B. 
Thus 

det (A ® B) = det (A' <g> B) = (det A' det B) n = (det A dot B) n . 

□ 

EXAMPLE 1.6.4. Let V = W = C 2 , and let End(V) = gl 2 C be endowed with the 
inner product (A, B) = ^tr (A H B) . Then an orthonormal basis for g[ 2 C is given 
by 

and an orthonormal basis for the M-vector space su 2 by I x , I v , I z . According to 
the previous lemma, an orthonormal basis for the C-vector space End (R 2 ® R 2 ) 
is made up by the set 

{l ® l, l <8> 4, l ® 4, l ® l z , 4 ® l, 4 ® 4, 4 ® 4,4 <8 4, 

4 ® l, 4 is 4, 4 ® i y ®iz,iz<s> i, 4 <8> 4, 4 <8 4> ^ ® -k} • 

The special choice of the basis (1.6.13) is motivated from quantum mechanics, 
where the matrices 4, 4> an d 4 are called Pauli spin matrices. 
Another aspect will become important later on. Consider the IR-linear span of 
the set 

n 

(1.6.14) X:={JXj, 
where 

Xj := {fn (8 ... ® 4i <8 ...(8 4, (8...<8l|o!i £ {a;,y,z},i= l,...,j}, 

and 




1, j even, 
0, j odd. 
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So the set X, comprises (up to a sign 1 or i) the n-fold tensor products of ele- 
ments in {1, I x , I y , I z } with exactly j factors different from 1. By construction, 
ICsu (2"), because for each element 

Y = <g> ...®I ai <Z> ...®I aj ® ... ® 1 e Xj, 

the equation 

= Pi®...8/ ai ®...®j aj ®...®i+(p)i®...®(-/ ai )®...®(-/ aj )8...«i = o 

holds by Lemma 1.6.3 (v). Since the span of X has the maximal possible di- 
mension 4" - 1, it follows that X is a tensor product basis of su (2™). This basis 
will be used for further calculations in our discussion of concrete n-particle 
spin systems, cf. Chapter 3. 



CHAPTER 2 



General Control Theory for Spin Systems 



2.1. Quantum Mechanics of Spin Systems 



We here give a short overview of the basic principles of quantum mechanics, 
and in particular describe the physics of spin systems whose control properties 
are in the focus of this work. This exposition is by no means complete but is 
intended to introduce all the terminology and concepts needed in the subse- 
quent sections. We refer the reader to [25 1 for an exhaustive treatment of the 
subject. 

The premise of non-relativistic quantum mechanics is that the state of physical 
objects like electrons, protons and neutrons as well as larger systems of those 
like atoms and molecules is represented by a wave-function tp. This function tp 
carries all the information of the state of the system under consideration. 
The collection of the physical relevant wave-functions is given by the state 
space, a separable complex Hilbert space H. This space could e.g. be the space 
L 2 of square-integrable functions M 3 — > C; a wave-function tp e L 2 would then 
contain information of where the particle is localized in three-space. To be a 
little bit more concrete, 



gives the probability of "finding" the particle within a measurable subset Q of 

R 3 . 

It is convenient to normalize the wave-function ip to have norm 



wave-functions which only differ by a non-zero scalar will be regarded equiva- 
lent. 

The time-evolution of a state tp is governed by Schrddinger's equation 



with H a Hermitian operator, which is called the Hamilton operator of the 
system and which might also be time-dependent. It models the presence of a 
field acting on the states and causing their dynamics. 



(2.1.1) 



Jq ip(x)ip*(x) dx 



f R3 ip(x)ip*(x)dx 



(2.1.2) 




(2.1.3) 



ihip = Hip 
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As an example, the Hamilton operator for a single particle of mass m moving 
in a one-dimensional harmonic potential is given by 

(2.1.4) H = --^- + ax 2 , Q6 1+ 

2m 

where A = denotes the Laplace operator. 

One of the principles of quantum mechanics says that it is not possible to ob- 
serve the wave-function ip itself by performing an experiment and thus to gain 
complete information about the system. What can be observed is the spectrum 
of certain Hermitian operators called observables. These are for instance the 
operators x (space), — iW (momentum), H (energy), — ihx x V (angular momen- 
tum), and others like e.g. "spin". 

Let ipj, j e N, be a complete set of orthonormalized eigenvectors for the observ- 
able A with eigenvalues Xj and assume the state ip at some fixed time t to be 
given by 

(2.1.5) i> = J2i^i>j)^- 

3 = 1 

Then the measurement of A at time <o will give the result Xj with a certain 
probability, which simply is given by the squared modulus \(ip,ipj)\ 2 of the co- 
efficient of tpj in above Fourier expansion. Thus the expectation value of A in 
the state ip(t ) is expressed as 

oo 

(2.1.6) (A) = J2 1(^)1% = (V.ity) • 

i=i 

The process of measuring A will change the state tp to if) = tpj, if the result 
of the observation was Xj . It therefore is not possible to perform at the state 
tp the exact measurement of two or more non-commuting observables. This is 
the statement of Heisenberg's uncertainty relation, see |25| for a quantitative 
discussion. 

Define the time-evolution operator U to be the solution of the differential equation 

(2.1.7) inil = HU, 17(0) = 1. 

This differential equation is again called Schrbdinger equation. It is easily 
seen that the dynamics of the state ip under the influence of the Hamilton 
operator H are given by 

(2.1.8) i){t) = U(t)ip(0) for all t > 0. 

It is therefore sufficient to study U in order to obtain a full description of a 
given quantum mechanical system. 

The discussion so far applies in particular to the spin of a quantum mechani- 
cal system, a phenomenon which is without analogue in classical physics. The 
simpliest examples of quantum mechanical systems containing spin are the 
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fermions, or spin- 1 -particles, like e.g. electrons, neutrons and protons. To 
carry spin in that cases expresses the heuristic imagination that those parti- 
cles possess an angular momentum, which comes from a rotation around their 
own axis and which is sensible towards a magnetic field (and only for that rea- 
son is measurable). 

The mathematical formulation of this phenomenon is as follows. Choose H = 
C 2 to serve as the state space and let S = (S x ,S y , S z ) denote the so-called op- 
erator of total spin. Thus S is a 3-frame of (2 x 2)-Hermitian operators S x , S y , 
S z , whose components will be specified below. The spin projection in direction 

of e = (e x , e y , e z ) el 3 , ||e|| = 1, is given by the Hermitian matrix 

(2.1.9) S-e:= e a S a eC 2x2 . 

a—x,y,z 

A measurement of S ■ e in the state ip e H has outcome ±ifi. Let x ± be normal- 
ized eigenstates for S z = S ■ e z with eigenvalues ±\fo. Then in the basis {x ± } 
of K the matrix representation of the spin projection operators is as follows: 

(2.1.10) S x = \ 

The dynamics of the wave function tp e H under the influence of a magnetic 
field B = {B x , B y ,B z ) G M 3 are described by the Hamiltonian operator 

(2.1.11) ff magn = constB • S = const{B x S x + B y S y + B Z S Z ) G C 2x2 . 

The generalization to ensembles of n spin-i-particles is as follows. The total 
spin in this situation is given by the operator S = (S x ,S y , S z ), 

n 

(2.1.12) S a :-(g)5^ eC 2 " x2 ", ae{x,y,z}, 

i=l 

with Si_ a the spin projection of the i-th particle in direction of e a , and ® denot- 
ing the Kronecker product. The projection of S in direction of e = (e x , e y , e z ), 
\\e\\ = 1, is the operator 

n 

(2.1.13) S • e := (g) • e) e C 2 " x2 " 

i=l 

which is acting on the Hilbert space H = (C 2 ) ® n . Its eigenvalues are — ^ , — ^ + 

1, ^ — 1, The eigenspace £ m to the eigenvalue |m, m = —n, —n+2, n — 

2, n, has dimension 





(2.1.14) dim£„ 



2 



If the n-particle system is exposed to a magnetic field B = (B Xl B y ,B z ) which 
we assume to be equal to B t = {B lx ,B iy ,B iz ) at the locus of the i-th. particle, 
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then its spin will be described by the Hamilton operator 

(2.1.15) H = H d + iT magn e C 2 " x2 " , 
with 

n 

(2.1.16) H magn = const^B, • S h 

i=l 

the factor B t ■ Si as given by equation (2.1.11), and an operator H d , which is 
fixed and describes the coupling between the spins of the individual particles. 
This Hamilton operator H will typically be occuring in the discussion of the 
control properties of Schrodinger's equation 

(2.1.17) mil = hu, 

which is the content of the following sections. 

2.2. The Control Problem 

Given the Lie group G = SU (2™) and the following family of Hermitian opera- 
tors (C 2 ) " - (C 2 ) ": 

m 

(2.2.1) H(vi,...,v m ) = H d + ^VjHj, v = ( Vl ,...,v m ) E« m . 

3 = 1 

Fix an element Uf € G and consider the right-invariant control system on G 
given by 

(2.2.2) U = -iH(v)U, U(0) = 1 

with v acting as control variable. The question of interest to us is whether it is 
possible to steer system (2.2.2) from the initial state U (0) to the final state Uf- 
If this is the case, what will be the minimum amount of time to achieve this? 

The motivation for treating that kind of problem in time-optimal control arises 
from questions concerning the quantum mechanics of spin systems, such as 
ensembles of electrons or neutrons. Indeed, the operator H(v) acts as the 
Hamilton operator for systems of coupled spin particles that are under the 
influence of an exterior magnetic field of fixed direction and variable strength 
(modelled by the variable v). So (2.2.2) is just Schrodinger's equation for the 
time-evolution operator U of such a system (with H set equal to 1). 
The desire to solve a control problem as formulated above came alongside with 
the development of certain experiments in nuclear magnetic resonance spec- 
troscopy (NMR) and quantum computing. Here one needs to manipulate en- 
sembles of coupled nuclear spins and wishes to do so in least possible time. See 
e.g. |8| and 1 26 1 for details on this topic. 

In the discussion to follow we are going to generalize this kind of control prob- 
lem from the specific case G = SU(2 n ) to arbitrary compact Lie groups. 
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2.3. Equivalence Theorem 



Throughout this section G denotes a compact Lie group with Lie algebra g, 
while K denotes a closed subgroup of G with Lie algebra t. We are interested 
in the following affine right-invariant control system on G: 



with H d 6 arbitrary but fixed, and Hi,..., H m a fixed set of generators for the 
Lie algebra 6. 

2.3.1. Heuristic considerations. Consider again the evolution equation 
of the affine right-invariant control system (2.3.1), and let U(0) = U e G arbi- 
trary. Note that in principle the control variables vj,j = 1, m, may be chosen 
to be arbitrarily large in comparison to the norm of the fixed drift Hamiltonian 
H d . For such a choice of v — (vi, ...,v m ) the control system (2.3.1) will behave 
roughly as 



Note also that, if we choose v to be constant, the solution of ODE (2.3.2) will be 



>From our assumptions on Hj, j = 1, ...,m, and Theorem 1.4.5 it follows that 
it is possible to steer system (2.3.2) to any point U in the coset KU C G, and, 
by choosing \v\ large, to achieve this in negligible time. Thus from the point 
of view of time-optimal control, group elements contained in the same coset 
KU C G can be considered equivalent. The question of interest therefore is 
to find appropriate control strategies to steer system (2.3.1) from the identity 
coset K to any other coset KU in least possible time. To this aim it turns out 
to be useful to replace system (2.3.1) by another right-invariant system on the 
group G, whose reachable sets coincide (modulo K) with that of (2.3.1), but 
which has bounded controls so that the phenomenon of arbitrary fast move- 
ment within a coset does no longer occur. 

Which kind of right-invariant control system on G will be the appropriate one? 
To answer this question assume that (2.3.1) evolves from U e G under the 
influence of the control variable v, which we still assume to be constant. So we 
set 



(2.3.1) 




U(0) = 1 



(2.3.2) 




(2.3.3) 




m 



(2.3.4) 
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and consider on G the ODE 



(2.3.5) 



U=(H d + H )U, U(0) = U . 



The solution of this ODE is given by 



(2.3.6) 



[/(*) = exp (t(H d + H ))U . 



We separate the flow t >—> U(t) into two components Q(t) and P(t), i.e. U(t) = 
Q(t)P(t). As we have noticed before, it is possible to steer the original system 
(2.3.1) within a given coset Kg arbitrarily fast. For this reason, the factor Q(t), 
which merely describes motion within K, will be factored out. The allowed 
directions for steering the new system on G are therefore given by the values 
of P(t). These are now obtained from the ansatz U(t) = Q(t)P(t) with U(t) as 
in equation (2.3.6) and Q(t) = cx.ptH Q e K. A calculation yields 



P(t) = Q-^{t)U{t) + Q- l {t)U{t) 

= cxp (-fcffo) (-H ) U (t) + cxp (-tHo) (H d + H ) U (t) 

= exp(-tH )H d U(t) 

= exp {-tH ) H d cxp (tH ) cxp (-tH ) U (t) 

= (Ad exp( _ tHo ) J ff<i)Q- 1 (t)f/(t) 

= (Ad cxp( _ tffo) ff d ) P (t) . 



This gives rise to the idea of replacing the original system (2.3.1) by the follow- 
ing control system on G: 



the if-adjoint orbit of H d in q. 

In view of our initial considerations it would be desirable to interpret system 
(2.3.7) as a control system on the space of right-cosets modulo K. However, 
it turns out that the expression XP can only be given a precise meaning as a 
tangent vector of G/K, if G/K is taken to be the space of left-cosets, i.e. G/K = 
{gK\g e G}. Then just means right-translation of the vector X e g by 
P e G/K. On the other hand, the reachable sets R(l, t) for both system (2.3.1) 
and system (2.3.7) are easily shown to be Ad/f-invariant, so that the identity 
KR(1, t) = R(l, t)K holds for all t > 0. This makes it plausible that (2.3.7) can 
be used to define on the left-homogeneous space G/K a control system, which 
is equivalent to system (2.3.1) on the group G. 
This idea will be given evidence in the subsequent section. 



(2.3.7) 



P = XP, P(0) = 1, 



where the control X is taken from 



(2.3.8) 



Ad K H d = {kHdk- 1 ] keK} 
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2.3.2. Equivalence Theorem. 

DEFINITION 2.3.1. The control system (2.3.1) will from now on be refered to 
as the unreduced system. We furthermore define on G the adjoint system to be 

(2.3.9) U = XU, U(0) = 1 X e Ad K H d , 
and on G/K the reduced system to be 

(2.3.10) P = XP, P(0)=K, XeAd K H d , 

where the expression XP is explained as follows. 
If P = ir{g), then 

(2.3.11) XP := Di(tt o R g )(X) e T gK (G/K). 

This is well-defined: If we replace g by g' = gk, k e K, then we find that 

D ± (ir o R g ,)(X) = D^n o R gk )(X) 

= £>i(tt o R k o R g ){X) 
= D^oR g ){X) 
= XP, 

aS 7T O R k = 7T. 

Notation 2.3.2. We label reachable and approximately reachable sets etc. 
for the unreduced, adjoint and reduced systems by lower indices 1,2 and 3, 
respectively. 

Note 2.3.3. For convenience we add to the admissible vector fields of systems 
1 — 3 the zero field. This does not change the reachable sets Ri(x, t) and R,(a;), 
i = 1,2, 3, nor does it have any effect on the problem of finding time-optimal 
trajectories for those systems. This assumption merely has the advantage that 
in the remainder we need not distinguish between the sets R,(a;, t) and Ri(x, t) 
and also might use the fact that the sets Ri(x, t) are monotonely increasing in 
t. 

The remainder of this section is aimed to establish a theorem which will show 
that all three of the systems defined above can be considered equivalent. To be 
able to give a precise formulation of what "equivalence" should be, we introduce 
some terminology. 

Definition 2.3.4. Let S = (M,f u ,U) be a control system. Define the set 
S(x, t ) of approximately reachable points from x e M within time t > to be 

(2.3.12) S(x,t ) := f| ROM). 

t>t 

Here R(x, t) refers to the reachable set from x within time t as defined in 1.4.3. 
Thus a point y e M is contained in S(x, to) , if and only if for any neighbourhood 
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U of y and any e > there exists a point z £ U H R(x, t + e). Furthermore, we 

define the infimizing time to steer S from xi e M to x 2 € M to be 

(2.3.13) 

inf {t e M|x 2 e S(xi,t)}, if x 2 e S(xi,t) for some t e M, 
oo, otherwise. 



ii„f (xi,x 2 ) := 



Remark 2.3.5. As a consequence of the boundedness of the set Adx H d it easily 
follows that 



(2.3.14) Sj(l,t) = Rj(l,t) 

holds for j = 2,3 and for all t > 0, cf. Proposition 2.3.9. On the other hand, the 
distinction between the closure of reachable sets and approximately reachable 
sets in the case of system 1 became inevitable since here the set of controls is 
unbounded. 



The equivalence between the control systems of Definition 2.3.1 can now be 
stated as follows. 

THEOREM 2.3.6. (Equivalence theorem). For allt>0 the following holds: 



(i) Si(l,t) = KR 2 (l,t) = R 2 (l,t)K, 



(ii) ir = R${K,t), where ir denotes canonical projection G — > G/K. 

The proof of the equivalence theorem is based on the subsequent propositions. 

PROPOSITION 2.3.7. (i) The reachable sets for the adjoint system 2 are Ad K - 
invariant, i.e. the identities 

(2.3.15) Ad K {R2(l,t)) = R 2 (l,t) 
and 

(2.3.16) Ad K (i? 2 (l,t)) = #2 (M) 
hold for all k e K and t > 0. 

(ii) Any trajectory t i-> U(t), t e [0,t F ], of system 2 is mapped under n to a 
trajectory 1 1-> V(t) := (7r o [/) (t) of system 3. Conversely, any trajectory t \-> V(t) 
of system 3 ca/z be lifted to a trajectory t^U(t) of system 2. In particular, 

(2.3.17) n {R 2 {l,t)) = R 3 {l,t) 
holds for all t > 0. 



PROOF, (i) Let the control t X(t) of system 2 generate the trajectory 
t i > f/(t). Then the control t ^ AdkX(t), k e K, generates the trajectory 
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1 1 ► AdkU(t), because 

±Ad k U(t)\ t=to = ki/it^k- 1 

= kxitoWi^k- 1 

= (fcX(to)fc- 1 ) {kU(t )k- x ) 
= Ad k X(t ) -Ad k U(t ). 

This implies that the set #2(1, t) is Ada -invariant for any t > 0. Since the map 
Adfe is a homeomorphism, the same holds for R 2 (l,t). 

(ii) Let X : [0,t F ] — > AdKH d , t i-> be any control for the adjoint and for 
the reduced system. Denote by t ^ EM*) € G and by t C/ 3 (t) e G/K the 
resulting trajectories. Then 

j t (noU 2 (t))\ t=to = D Ua(to) * (u 2 (toj) 

= D Ua(to) n(X(to)U 2 (to)) 

= D U2{to) 7r(D 1 R U2{ta) (X(t ))) 

= D 1 (noR Ua(to) ){X(t )) 

= X(t Q ){7ToU 2 (to)). 

This shows that both U 3 and tt o U 2 satisfy ODE (2.3.10) on G/K together with 
the initial condition P(0) = K. Therefore U 3 = it oU 2 holds everywhere on 

[0,t F ]. □ 

Proposition 2.3.8. For all t F > 0, 

(2.3.18) R 1 (l,t F )CKR 2 (l,t F )- 

PROOF. Let U F e i?i(l, t F ) and t v(i) e K m a control for system 1 such 
that the corresponding trajectory t ^ U(t) satisfies U(t F ) = U F . Now let t 
Q(t) e K the solution curve of the ODE 

Q={^jOiH^jQ, Q(0) = 1, 

and 1 1 — ► P(t) e G the solution curve of the ODE 

P=(Q- 1 H d Q)P, P(0) = 1. 
On [0, t F ] consider the map i i-> V(i) := Q(t)P(t). It satisfies V(0) = 1 and 
V(t) = Q(t)P{t) + Q{t)P(t) 

= Vi(t)H?J Q(t)P(t) + Q(t) (Q-\t)H d Q(t)) P(t) 

= (jtvi{t)Hi + H?jQ(t)P(t), 



46 



2. GENERAL CONTROL THEORY FOR SPIN SYSTEMS 



which shows, by the uniqueness part of the Caratheodory theorem, that V 
coincides with U on [0,£f]- Now let system 2 evolve according to the control 
lawt^ Q^i^HdQit) fort e [0,t F ]. Then 

P{t F ) = Q^{t F )V{t F ) = Q-ifoWtp) = Q- l {t F )U F G KU F . 

This shows U F G KP(t F ) C KR 2 (l,t F ), as claimed. □ 



PROPOSITION 2.3.9. The set R 2 (x, t) is compact for all x G G and t > 0. More- 
over, the following holds: 



(2.3.19) Ri(x,t) = p) R 2 (x,t+ -\ 

n=l ^ ' 

PROOF. Let (-,-) be any right-invariant metric on G. The set R 2 (x,t) is 
bounded, because the set AdxH d C g of controls is bounded by a constant 
M (in the norm induced by the scalar product on S = TiG) . So, by 

the right-invariance of the metric (•, •), we have that ||/ u (z)|| = ||u|| < M for 
all u G Ad K Hd and z G G. Hence the distance d(x,y) between ir and any 
y G i?2(a;, t) can be estimated as follows: 

d(x,y) < [ \\u(s)\\ ds< [ M ds = Mt. 
Jo Jo 



Therefore, R 2 (x, t) is bounded, and R 2 (x, t) is compact. 



To prove equation (2.3.19), we first observe that R 2 (x, t) is contained in any 
of the sets R 2 [x,t + n e N, thanks to the convention made in 2.3.3. So 
R 2 (x, t) C Pl^Li Rz ( x > t + tt) • Now assume that there exists 

y G ^f) R 2 (x,t + ^yj \R 2 {x,t). 

Then y has distance d > from the compact set R 2 (x,t). >From this and the 
boundedness of the controls it follows that the infimizing time needed to steer 
system 2 from R 2 (x, t) to y is positive, i.e. 

inf | e > 0| y G R 2 (x,t + e)j > 0, 



in contradiction to y G H^Li ^2 (x, t + ^) . □ 
Proposition 2.3.10. For allt>0 the following holds: 



(2.3.20) ifc(l,t) CA72i(l,t). 



PROOF. Let x G i?2(l,i)- By definition there exists a control X : [0,t F ] — > 
AdxH d such that the resulting trajectory < i-> P(t) of system 2 satisfies P(t F ) = 
x . Theorem 1.2.2 allows us to identify the smooth manifold Ad K H d with the 
homogeneous space K/StabKH d . From this identification it becomes clear that 
the path t ^ X{t) g K/Stab K H d can be lifted to a path t ^ Q^it) G X with 
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Q(0) = 1 and the same regularity properties astns X(t). We therefore have 

(2.3.21) X{t) = Q-\t)H d Q{t) 

on [0, tp]. Notice that the path t h-» need not occur as a trajectory of the 
control system 

(2.3.22) Q = (^ViH^j Q, Q(0) = 1, v = fa, ...,« m ) e K m 

on Jf. But as it is pointed out in 1 16 1 and proved in 19 1, there exists a sequence 
of control functions t h-» v n (t) e R™, t e [0, ip], such that the resulting sequence 
(Q") n of trajectories for system (2.3.22) converges in Li against Q. Now define 

for t e [0,t F ] 

X n (t) := (Q")- 1 (t)H d Q n (t) e Ad K H d . 

Then, by equation (2.3.21) and the definition of Q n , the sequence {X n ) n con- 
verges in Li against X. Furthermore, let P n the solution of the ODE 

P n =X n P n , P"(0) = 1. 

The convergence of (X n ) n against X implies 

lim P n (t F )=P(t F ), 

n—>-oo 

cf. G|, p. 41-42. We finally set U n := Q n P n . Since the function Q n satisfies the 
ODE (2.3.22), it follows from the same calculation as in the proof of Proposition 
2.3.8 that U n solves the ODE 

U n = Lffd + ]TJ vfH^j U n , U n (0) = 1 
on [0, t F ] and therefore is a trajectory of system 1. So we have found that 

.T = P(t F ) 

= lim P n (tp) 

= lim (Q™)- 1 (tp) U n (t F ) 

= Q-^tp) lim U n (t F ) 



e KR^i.tp), 

as claimed. □ 

PROPOSITION 2.3.11. K C 5i(l,0). As a consequence, if z e Sx(l,t) for some 
t > 0, then zk e (1, t) holds for all k e K. 

Proof. By assumption, the elements Hi,...,H m e 6 generate t as a Lie 
algebra. This implies that the system 

(2.3.23) W = l^ViHA W, W(Q) = 1 



48 



2. GENERAL CONTROL THEORY FOR SPIN SYSTEMS 



is controllable as a system on K, cf. Theorem 1.4.5. Moreover, by the same the- 
orem, there exists a constant T > such that R(l) = R(l, T). Since the norm 
of the operator Y^Li v i^i may be chosen to be arbitrarily large, this equation 
holds for any constant T > 0. 

>From now on let T > and Wf G K be arbitrary but fixed, and choose a 
control t h-> (vi(t), ...,v m (t)), t e [0, tp], *f < T 1 , such that the resulting tra- 
jectory t h-» W(t) satisfies W(*f) = Wf- Then for all n e N the trajectory 
i i— > iy n (t) of (2.3.23), which results from the control t i— » ?; n (t) := nv(t), satis- 
fies W n (±t F ) = W F . Now consider the ODE 



U n = H d + n yMt)Hi U n , U n (0) 



i=i 



and let e > be arbitrary. Then for n sufficiently large it follows that 

1 



-*F 



< e, 



cf. (3|, p. 122. Thus 

W^F G i?l(l,tF) C Rx(l,T) 
holds for all T > 0. By definition, this implies W F G Si (1,0). Since Wf G K 
was chosen to be arbitrarily, it follows that K C Si(l, 0). 

The addendum that zfc € £i(l,t), if z G Si(l,t) and fc € if, is an immediate 
consequence of the right-invariance of system 1. Namely, if we can steer system 
1 into any neighbourhood Z e of z at time t + e and into any neighbourhood K s 
of k at time e, than the system can likewise be steered at time t + 2e into an 
arbitrary small neighbourhood Z e K e of zk. □ 



We now turn to the proof of the Equivalence Theorem 2.3.4. 



Proof, (i) From Proposition 2.3.7 (i) we obtain 

|J kR 2 (l,t) = |J fci? 2 (l,<)fc- 1 fc = |J i? 2 (l,i)fc, 

so that the identity 

KR^t) = R^T^) K 

holds for all f > 0. 

(A) Si(l,t) CKR 2 (l,t). 

Let x G 5*1 (1, t). By definition of Si(l,i) there exists a sequence (y„)„ in G 
with y„ e i?i (l,t+ ^) n tA(:r) and lim„_ 00 ?/„ = x. Here t/ E (:r) denotes the 
open ball around x of radius e. For all n G N there exists, by Proposition 2.3.8, 
z n e i?2 (l, < + ^) and k n e K such that y n — k n z n . Since all z n are contained in 
the compact set R 2 {l,t + I) and K also is a compact set, we find a subsequence 
of (z n , fc„) n (which we label again by n) and which satisfies 

lim (z n ,k n ) = (z ,k ) <E i? 2 (l,i + 1) x K. 
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By definition of z n we have that z e R 2 (l,t + e) for all e > 0. This implies 
together with Proposition 2.3.9 that 



It then follows that 



zo e fl R 2 =^(l,i). 

n=l ^ ' 



x = lim y n 

n — >oo 

= lim (k n z n ) 
= lim k n lim z n 

n^oo n^oo 



e KR 2 {l,t). 



(B) KR 2 (l,t) C ^(l.t). 

Let k e K, x e R 2 (l,t). Then by Proposition 2.3.10, x = fc'y for some k 1 e K 
and j/ e t). It follows that 



fcx e c KSx(l,t). 

Now by Proposition 2.3.11, K C ^ (1,0), hence A"5i(l,t) = Si(l,*). This shows 
kx e S*i(l, t) and implies KR 2 (1, t) C Si(l, i). Furthermore, if and also S*i(l, i) 
is closed, so that 

KR 2 (l,t) = KR 2 {l,t) C Si(l,t). 

(ii) From Proposition 2.3.7 (ii) and the fact that the images of compact sets 
under the continuous map tt are compact, it follows that 

R 3 (K,t F ) = n(R 2 (l,t F )) = tt (i? 2 (l,t F )) . 
Combining this with part (i) of the proof we find that 

Rl^KjPj = tt (iMV^) ) =7r(i? 2 (l,t F )if) = 7 r(S 1 (M F )). 
This concludes the proof of the equivalence theorem. □ 
Corollary 2.3.12. For a/Z ieG, 

(2.3.24) i inf;1 (1, x) = t infi3 (Jf, 7r(x)) . 

Proof. Statement (ii) of the equivalence theorem implies that for all p e 
G/K there exists y e 7r _1 (p) such that (K,p) — t inf>1 (l,y). Now for any 
x = yk e 7r _1 (p) we have by Proposition 2.3.11 that i^i (1, a;) = t inf;1 (1, y). So 
^inf,i (1, x) = iinf,3 (-Jf, i"(aO) holds for all x e G, as claimed. □ 

Corollary 2.3.13. Assume £/ie se£ 

(2.3.25) W := |J XAd K H d C 

Ae[o,i] 

to 6e convex. 77ierc £/ie equivalence theorem can be restated as follows: 
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(i) Si(l,t) = KR 2 (l,t) = R 2 (l,t)K, 

(ii) n(S 1 (l,t))=R 3 (K,t). 

Proof. Replacing the set Ada Hd of control parameters for system 2 by 
the set W will not change the reachable sets JRq(1, t) since the trajectories that 
can occur then are just reparametrisations of the trajectories one already had 
before for system 2. Also there will be no effect on the infimizing, respectively 
minimizing times because the tangents P(t) of the now occuring trajectories 
are of equal or smaller length than before, since A £ [0, 1]. The same holds for 
the reduced system 2. Because the set W is compact and due to our assumption 
convex, we may apply Filippov's Theorem (cf. (2, Theorem 10.3) to obtain the 
compactness of the sets R 2 (l,t) and Rs(K,t). Thus in the statement of the 
equivalence theorem, the expression R 2 (l,t) can be replaced by R 2 (l,t), and 
R 3 (K, t) can be replaced by R% (K, t) . □ 

Remark 2.3.14. We do not know if there is a criterion of how to decide in a 
concrete situation (where a subgroup K C G and a vector H d e g are given), 
whether the set W of the previous corollary is convex. This union of adjoint 
orbits turns out to be convex for instance in the example of Section 3.2. But 
one can also find low-dimensional examples, where W is not a convex set. 



2.4. The Maximum Principle for Compact Lie Groups 

In Section 1.5 the maximum principle of Pontrjagin (PMP) has been discussed 
as a tool for determining extremal trajectories in a given optimal control situ- 
ation. The application of PMP involves the optimization of functionals on the 
set U of controls, which take the form 

(2.4.1) u i — >H{x,u), 

where x is an arbitrary but fixed point of the phase space T*M, while 

(2.4.2) H(-,u):T*M — ► K 

is the Hamiltonian function associated with the optimal control problem. An 
immediate application of the maximum principle to our original (unreduced) 
control system 1 yields in general no further information on the optimality 
of a given control function, since in this case the space U of control param- 
eters is unbounded. So in general there need not be a control u e U which 
maximizes the functional (2.4.1). This is one of the main differences to the 
adjoint system 2. Here the space Ad K H d C jj of control parameters is compact, 
so that the above functional always attains its maximum. Thus the passage 
from system 1 to system 2 via the equivalence theorem makes the time-optimal 
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control problem of Section 2.2 accessible to an application of Pontrjagin's max- 
imum principle. We take this observation as an occasion to discuss PMP on Lie 
groups in some detail. 

Our discussion will be specialized to optimal control of right-invariant systems. 
By this we mean a control system £ = (G, f u ,U), where the set U of control 
parameters is contained in the Lie algebra g of G, and the admissible vector 
fields /„, u G U, are the right-invariant extensions of u, see Lemma 1.1.5. The 
results we are now going to discuss apply in particular to the adjoint system 
(2.3.9). The reason why right-invariant systems allow for a significant simpler 
formulation of Pontrjagin's maximum principle is due to the following facts. 

• The right-invariant vector fields /„, u £ U, can be considered to be 
contained in the finite dimensional Lie algebra g, not just as elements 
of the infinite-dimensional algebra T(TG) of arbitrary vector fields on 
G. 

• There is a canonical isomorphism between T*G and G x g*, which al- 
lows to describe Hamiltonian functions and Hamiltonian vector fields 
by globally defined coordinates. 

• Moreover, if G is compact, it is possible to make use of the existence 
of an ad-invariant inner product on g, which allows to further identify 
T*G with G x g. An advantage of this identification is that Hamil- 
tonian functions and Hamiltonian vector fields often become easier to 
describe in coordinates of g and G rather than in coordinates of T*G, 
respectively of G and g*. 

Our discussion follows the book 1 1 1 by Agrachev and Sachkov. Similar results 
can also be found in Mittenhuber's paper 1 23 1 . 

PROPOSITION 2.4.1. Let G be a Lie group with Lie algebra g. Denote by T*G 
the cotangent bundle of G. Then the map 

(2.4.3) $ : G x g* — » T*G, (g, A) !— » (D g R g -i)*(X) 

is a vector bundle isomorphism. Here (D g R g -i)* : g* — » (T g G)* denotes the dual 
of the linear map D g R g -i : T g G — > g. 

Proof. |23|, p. 187. □ 

For the remainder of this section we restrict our discussion to the case of a 
compact Lie group G. Such groups can be endowed with an ad-invariant Rie- 
mannian metric (•, •), cf. Example 1.2.4. This can now be used to first identify 
g* with g via the scalar product (;-) v Combining this identification with the 
map $ of Proposition 2.4.1 then yields an isomorphism between the vector bun- 
dles G x g and T*G. The ad-invariance of the metric is certainly not necessary 
for the existence of a vector bundle isomorphism T*G = G x g. Such can be 
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defined by using any Riemannian metric on G, and thus also exists for noncom- 
pact Lie groups G. The point is that the equations defining the Hamiltonian 
vector fields which occur in the general statement of the maximum principle, 
cf. equation (1.5.16), become particularly simple when choosing a trivialization 
via an ad-invariant metric. This point of view is substanziated in the following 
theorem. 

THEOREM 2.4.2. Let £ = (G, f u , U), U C q, be a right-invariant control system 
on a compact Lie group G. Moreover, let (pbe a cost function, which does not de- 
pend on the position variable g e G, i.e. <p : g — > K. Use the above identification 
T*G = G x Qto define the Hamiltonian function 

(2.4.4) H u (-,u):Gxg — ► M, (g, X) i— » (X, u) 1 + v<p(u), 

where u eU and v £ {— 1, 0}. Then the Hamiltonian vector field associated with 
H v (-, u) reads 

{■ _ dH"(-,u) 
9- ox 9, 
x - [ A ' ax ■ 

The Hamiltonian lift t i-v (g(t), X(t)) € G X q of any optimal trajectory t^g(t) 
for the optimal control problem (£, tp) which results from a control t i— > u(t), 
t G [0, T], satisfies the extremality condition 

(2.4.6) H" (g(t), X{t), u(t)) = sup H" (g(t),X(t), u) 

ueu 

almost everywhere on [0, T], 



Proof. 1 1 1, Theorem 12.10, and p. 264. □ 

Remark 2.4.3. See Remark 1.5.2 for the role of the parameter v e K in the 
Hamilton function H v {-, u) of the previous theorem. It can be argued that only 
the cases v = — 1 (so-called normal case) and v = (so-called abnormal case) 
need to be distinguished, as Hamilton functions with v > lead to trajectories 
that maximize the cost functional, while those with v < can be replaced by 
H v= ~ x after rescaling the cost function (p, cf. Q), p. 180. 

EXAMPLE 2.4.4. Let S = {G,f u ,U} be a right-invariant control system on 
the compact Lie group G, where U := {u e g (u,u) 1 = 1}. We are interested 
in time-optimal trajectories of S, and therefore set <p = 1. The Hamiltonian 
function (2.4.4) in this case reads 

(2.4.7) H"(;u) : G x g — ► R, (g, X) t—> (X, u)^ + u, 

and the corresponding Hamiltonian vector field is 



(2.4.8) 



9 = ug, 
X = [X, u] 
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The maximality condition of PMP implies that u{t) and X{t) have to be paral- 
lel for almost all t in order to maximize the term (X,u) 1 in (2.4.7). But then, 
according to the second equation in (2.4.8), X = 0, so that X is constant. Then 
also the control function u is constant. So the first equation of (2.4.8) reads 
g = Zg with some Z e U, which is independent of t. This equation can be 
integrated and yields the solution g(t) — exp(tZ)g . So the fastest way to steer 
system E is along the integral curves of right-invariant vector fields. 
Since G has been endowed with a bi-invariant metric, our time-optimal prob- 
lem can be, by the choice of U , considered a length-optimal problem. We thus 
have as a result that a lenght-minimizing curve necessarily is of the form 
t i ^ exp(tZ)g . Indeed, it can be shown that any geodesic of the Riemannian 
manifold (G, (-, •}) is of that form. 

Theorem 2.4.2 can be applied to the problem of finding time-optimal controls 
for the adjoint system 

(2.4.9) U = XU, U(0) = 1, X e Ad K H d . 

The associated Hamilton function and Hamilton vector fields to this control 
problem are as stated in Theorem 2.4.2, namely (with ip = 1 as cost function) 

(2.4.10) H(-,u):Gxg — ► R, (g,X)i — >(X,u) 1 +v 
and 



(2.4.11) 



I g = ug, 

\ X=[X,u}. 

The maximality condition of PMP for system (2.4.9) therefore reads 



(2.4.12) H max (g,X)= max (X,u) 1 . 

ueAd K H d 

We next derive a necessary condition for u e Ad K H d to satisfy (2.4.12). 

PROPOSITION 2.4.5. Fix leg. Then u = Ad ko H d g Ad K H d is a local maxi- 
mum of the function u (X, u)^ if the following holds. 

(i) ([X, u ] , Z} 1 = for all Z G t, 

(ii) ([Z, X] , [u , Z}} 1 <0 for all Z G I 
Condition (i) is necessary for (2.4.12) to hold. 

PROOF. If (2.4.12) holds, then for all Z G £ 

= ^ (Ad eX p tz uo,X) 1 | t=0 

= (adZ(u ),X) 1 

= ([Z,u ],X) 1 

= -{[X,u ],Z) x , 
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which gives the necessarity of condition (i). Now the function k f(k) := 
{Ad k H d , X) 1 has a local maximum in k a if condition (i) together with 



d 2 

—f(exptZ-k ) 



< for all Z e e 

t=o 



holds. But 



/(expiZ-fco) = (AdexptZ-fco^d'^Ol 

= (u ,X) 1 + t ([Z, u ] ,X) 1 + it 2 ([Z, [Z, uo}] ,X) 1 + (t 3 ) , 
so condition -C-/ (exp tZ ■ k ) < is equivalent to (ii). □ 

Our next goal is to derive a family of solutions of ODE (2.4.11). 

Theorem 2.4.6. Let H d e E 1 , i e Ad K H d , and C et Then for 
(2.4.13) u(t) := Ad exp( _ ct) A, 

a solution of ODE (2.4.11) is given by 

(2.4.14) 



I g(t) = cxp(-Ci) exp(Ci + At), 
\ X(t) = -u(t) + C. 

The corresponding Hamilton function is H(g,X,u) — (X,u) x . This Hamilton 
function also satisfies the maximality condition (i) of the previous proposition. 
Hence t i-> g(t) is an extremal trajectory of the time-optimal control problem 
associated with system (2.4.9). 



PROOF. We first notice that u(t) e Ad K H d holds for all t by the choice of 
C e t. A differentiation with respect to t now yields 

g(t) = -C*cxp(-Ct)cxp(Ci + At) +cxp(-CT)(C + A)cxp(Ct + At) 
= exp(-Ci) A cxp(Cf + At) 
= exp(-Ci) A cxp(Ci) exp(-Ci) cxp(Ct + At) 
= u(t)g(t) 

and 

X(t) = -u(t) 

= CAd cxp( _ ct) A - (Ad oxp( _ ct) A) C 
= [C,u(t)} 
= [X(t),u(t)}, 

as claimed. The fact that Hamilton system (2.4.11) arises from the Hamilton 
function H(-, u) is part of Theorem 2.4.2. It remains to show that for X = X(t), 
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u = u(t), and for all Z e £ the equation 

0=([X,u},Z) 1 

holds. Since 

— u = [X, u] 

and ii is tangent to AdxH d , i.e. u = [Y, AdkH d ] for some Yet and k e K, the 
last condition is equivalent to 

= ([Y,Ad k H d ],Z) 1 

for all Z e t. This is satisfied, because 

([Y,Ad k H d ],Z) 1 = (Ad k [Ad k -iY,H d ],Z) 1 
= {[Ad k -,Y,H d ],Ad k -,Z) x 
= -(H d ,[Ad k -iY,Ad k -iZ}) 1 
= 

holds as a consequence of the Ad-invariance of the inner product (•, ■} 1 and of 
our assumption H d e t 1 - ■ □ 

SUMMARY 2.4.7. >From Theorem 2.4.6 we obtain a whole family of extremal 
trajectories associated with the control system (2.4.9). These are parametrized 
by real numbers A and C, their role being the following. The parameter 
A = g(0) determines the direction of the trajectory t i-> g(t) at its starting 
point, while the parameters A and C jointly fix the direction at t = of the 
component x(t) of the Hamiltonian lift of g(t), as i(0) = — u(0) = [C, A]. Thus 
in a subsequent step one would have to determine those pairs (A, C) which ac- 
tually give rise to a time-optimal trajectory. 

For the special class of adjoint systems that we shall consider in the following 
section, it turns out that only the choice C = can lead to time-optimal trajec- 
tories. So from this example one can see that the set of extremal trajectories in 
the sense of PMP will in general be considerably larger than that of actually 
time-optimal trajectories. 

2.5. Time-Optimal Torus Theorem 

In the following an explicit solution to the control problem as described in Sec- 
tion 2.2 and reformulated in the Equivalence Theorem 2.3.6 will be discussed 
under the additional assumption that the homogeneous space G/K in that 
theorem gives rise to a symmetric Lie algebra pair (jj, t). 

Thus in the following we fix a compact, simply connected, semisimple Lie group 
G together with a closed subgroup K such that their Lie algebras g and 6 form 
a symmetric Lie algebra pair (g,t). Let g = t® p the corresponding Cartan-like 
decomposition, and take 1} C p to be a maximal abelian subalgebra of p. Denote 
by A the torus in G with Lie algebra f). 
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THEOREM 2.5.1. (Controllability). Let G be a compact Lie group with simple 
Lie algebra q. On G consider the affine right-invariant control system (2.3.1) of 
Section 2.3 with the Hamiltonian 

m 

(2.5.1) H(v 1 ,...,v m )=H d + J2 v i H r 

3 = 1 

Let t the Lie algebra generated by Hi, H m and assume (g, 6) to be a symmetric 
Lie algebra pair. Denote by 9 its Cartan involution and let \) be a maximal 
abelian subalgebra ofp that contains the projection H of Hd on p (such exists 
in view of Lemma 1.3.8). Assume furthermore that H is generic in the sense 
that it is not contained in any root hyperplane (of a root space decomposition of 
0c with respect to \)). 

Then system (2.3.1) has reachable set R(l) = G, i.e. it is controllable. The same 
holds for the respective reduced system on the symmetric space G/K. 

Proof. The result follows from Theorem 1.4.5 if we can show that the Lie 
algebra generated by t and H d is equal to g. Extend t) to a maximal abelian 
subalgebra t of g and let 

0C = tc © (J) 0a 

t«es 

be the root space decomposition of gc with respect to t. Choose as in (1.3.7) a 
subset E + C E of so-called positive roots. Since g is simple, it follows from 
Theorem 1.3.6 (hi) that the root spaces g a are all 1 -dimensional. We can there- 
fore write 

(2.5.2) flc = tc © CX a 

with X a e Q a \ {0} arbitrary. The root space decomposition (2.5.2) is related to 
the -1-eigenspace p of the Cartan-like decomposition g = fi © p in the following 

way: 

p = f) © n, 

where 

n:= pnc{x a -ex a ), 

Q6S+ 

cf. 1 10|, p. 335-336. Furthermore, the following sum is direct: 

tr\C{x a + ex a ) c t 

qGE+ 

So for any a e E+ we can choose c e C such that c (X a + 6X a ) € t \ {0}. We 
then find that 

[H ,c(X a + 6X a )} = ca(H )X a + 9[9H ,cX a ] 
= ca(H )X a -c6[H 0l X a ] 
= ca(H ) (X a - 9X a ) . 
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The commutator relation [p,t] C p together with our assumption a(H ) ^ for 
all roots a now implies that 

[H ,f\=n. 

Set i := (t, n, -ffd) Lie = (6, n, H ) Uc . It remains to show that f) C i. Assume 
X e Let Z e i arbitrary and write 

Z = Z 1 + Z 2 + Z 3 with Zi et,z 2 e n, z 3 e f). 

Repeating the calculation before yields [X, Zi] e n. Furthermore, [(), f)] =0 and 
[f),n] C [p,p] C so that 

[X, Z] - [X, Zi] + [X,Z 2 ]en + iC i. 

This shows that i is an ideal of the simple Lie algebra g, which is not possible 
unless i = g. 

The statement on the reduced system follows immediately from Theorem 2.3.6 
(ii). □ 

REMARK 2.5.2. A different proof of Theorem 2.5.1, without the assumption 
that H a be not contained in any root hyperplane, can be found in |5|. 



We now turn to a discussion of time-optimal control. To fix ideas we initially 
consider the simple example of a single-particle system. Here the underlying 
Lie group is G = SU(2), the drift operator can be chosen to be 



H d = 

and the free Hamiltonian to be 

Hx = 



i 

-i 

1 

-1 



G 0U(2), 



e 5u(2). 



Denote by 6 the 1-dimensional Lie algebra spanned by H\. It generates the 
compact Lie subgroup 



K 



cos t sin t 
— sin t cos t 



of G. We first observe that any element Uf = 
be decomposed as 

(2.5.3) U F = U x k 

. , / cost sint \ 

with fc = e K and U x = 

\ — sini cost J 

calculation shows that we can choose 
Reb 



t 6 Rj 

a b 
—b a 



aa + bb — 1, of G can 



r s 
s r 



symmetric. Namely, a 



(2.5.4) 



tan(t) 



Rea' 



r = a cos t + b sin t, s = —a sin t + b cos t. 
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Now Uf and U\, being contained in the same coset of K, are for all £ > 
either both contained in the approximately reachable set Si(l,t) or both not 
contained in Si(l,t), see Proposition 2.3.11. Therefore, in the time-optimal 
control problem for the unreduced system, Uf can be replaced by Ui. Time- 
optimal trajectories from 1 to U\ for the adjoint system 



may now be obtained as follows. The symmetric matrix U\ admits a decompo- 
sition as 



where k e SU{2) is some orthogonal matrix, i.e. kk T = 1, and t a e M. A calcu- 
lation shows that one can choose k e K. Furthermore, we choose f such that 
|t 1 is as small as possible. Since the set of controls for the adjoint system is 
invariant under conjugation by elements of K, U\ can be replaced by the new 
terminal point U 2 = cxpt Q H d . The key observation is then that a time-optimal 
trajectory between 1 and U 2 is given by geodesies t ^ exptH d of a maximal 
torus A of G that contains H d . This trajectory is generated by a constant con- 
trol function u(t) = H d or u(t) = -H d , the sign depending on that of to. 
The main argument in the proof of this observation will be that a suitable 
projection of an arbitrary trajectory 1 1-» U(t) between 1 and U 2 into the maxi- 
mal torus A leads to another trajectory t ^ V(t) for the adjoint system, which 
reaches U 2 in the same time as U(t). 

This projection process reduces the original (adjoint) control system to a sys- 
tem on the torus A, which can be solved easily because the admissible vector 
fields are now pairwise commuting. In our particular example, those admis- 
sible vector fields are the right-invariant extensions of H d and —H d . In the 
higher-dimensional cases, this set of vector fields will become the Weyl orbit 
W ■ H d of H d as defined in Section 1.3.1. Optimal trajectories will then again 
be generated by choosing piecewise constant controls within the set W ■ H d , 
and therefore will be geodesic arcs on a maximal torus A of G. 
We next consider the general case of a compact semisimple Lie group G and a 
symmetric Lie algebra pair (g, 6). Our observations made so far motivate the 
following theorem. 



THEOREM 2.5.3. (Time-optimal torus theorem). Let G be a compact, simply 
connected, semisimple Lie group, and K C G a closed subgroup. Assume that 
their respective Lie algebras form a symmetric pair (g, t), its Cartan-like decom- 
position being q = 6 © p. Let t) be a maximal abelian subalgebra of p, H d e 1} 
a generic point in the sense of the previous theorem, and A C G the maximal 
torus with Lie algebra f). Furthermore, denote by Y\,...,Yi the elements of the 



(2.5.5) 



U = XU, X G Ad K H d 



(2.5.6) 



f/i =k(expt H d )k- 1 



2.5. TIME-OPTIMAL TORUS THEOREM 



59 



Weyl orbit W ■ H d . Set 

(2.5.7) Q:=Ad K & = {kak- 1 \aen,keK}, 

and let Uf € © arbitrary. Here SI C A is a certain domain in A, which will be 
specified in Lemma 2.5.7. 

Then the minimal time t min (Uf) for steering the adjoint system 

(2.5.8) U = XU, 17(0) = 1, XeAd K H d 

to Uf is equal to a* (Uf), which we define to be the smallest non-negative value 
of a such that the equation 

(2.5.9) U F = kexp fc" 1 

can be solved with k e K, fa e R, and J2\=i Pi = 1- Moreover, a time-optimal 

trajectory to Uf is given by 

(2.5.10) 

!exp (ifcYifc- 1 ) , te[0,a/3i], 
; \ 
exp (tkYik^ 1 ) ]li=i cx P (aPikYik^ 1 ) , te a^jft,a 

The proof of this theorem needs some preparation. 

To start with, we introduce the root space decomposition 

(2.5.11) 0c-0o©00 Q 

of 0c with respect to f), cf. Theorem 1.3.2. Note that this is not quite the usual 
root-space decomposition, since t) was only assumed to be maximal abelian in 
p, but not in g. We will nevertheless call the spaces Q , a e £, root-spaces, 
and the linear forms a : f) — > M roots. The subset of roots which do not vanish 
identically on p is denoted S p . We define the Weyl group in the same manner 
as before to be the quotient 

(2.5.12) W := JV(f))/Stab(fj). 

In contrast to the previous definition, the isomorphism type of W now depends 
on the pair (g, t), not on g alone. 

Having fixed our notation in this way, we make now the following definition. 

Definition 2.5.4. The following subset off) is called the diagram D associated 
with the symmetric Lie algebra pair (0,6): 

(2.5.13) D:={X a(X) e mZ for some a e S p } . 

The diagram D is the union of finitely many families of equidistant hypersur- 
faces of t). The connected components of t) \ D are called cells. Those are pair- 
wise isometric, open polytopes in t). The set D contains the root hyperplanes 
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a(X) = for some a e £}, so that the arrangement of Weyl chambers 
in f) is now further subdivided into the system f) \ D of cells. 
The set of reflexions on the hyperplanes that constitute the diagram D gene- 
rates a group Wag, the so-called affine Weyl group. This group contains W as a 
subgroup. More precisely, W^e is isomorphic to the semidirect product W x T 
of W with the group T of translations in f) that map D onto itself. 
The affine Weyl group acts transitively on the set of cells. Furthermore, if A is 
a cell such that e A, then any orbit W & s ■ X, X e \), intersects A in exactly 
one point. 

For further details on the affine Weyl group and the diagram, see [101 and |6|. 
The relevance of the affine Weyl group for the proof of the time-optimal torus 
theorem becomes clear through the following decomposition lemma. 

LEMMA 2.5.5. (if AftT-decomposition). Let G be a compact, simply connected, 
semisimple Lie group. Then each g G G yields a decomposition 

(2.5.14) g = k ia k 2 , 

with ki,k 2 G K and a — exp X G A. The factor a = exp X in this decomposition 
is unique up to an action of the affine Weyl group. So if 

(2.5.15) k x ak 2 = k[a'k' 2 

are two decompositions of the above type with a = exp X and a' — exp X' , then 
there exists w G Waff such that X' = w ■ X. The factor a becomes determined 
uniquely, if in addition the requirement X G A is imposed. Here A denotes as 
before a cell whose closure contains 0. 



REMARK 2.5.6. Our initial assumption on G to be simply connected is only 
needed for the proof of the last lemma. We do not know a version of the KAK- 
decomposition lemma without this assumption. 

LEMMA 2.5.7. Let A e f) a cell as in Lemma 2.5.5, and set Q := exp A C A. 
Define the map 



Proof. [10|, p. 321-323. 



□ 



(2.5.16) 



$ : K x A x K 



k\ak 2 , 



and let 



(2.5.17) 



7r 2 : K x A x K 



A 



be the projection map onto A. Then the following holds: 
(i) For each g G G, the set 



(2.5.18) 




consists of a single element ttaG?)- The map 



(2.5.19) 



tta=:G 



n, a 
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is continuous. Moreover, 

(2.5.20) ir A \ n = id. 

(ii) Set G rog := tt^ 1 (SI). Then the restriction ofiiA to the set G reg yields a dif- 
ferentiable map. Its differential is for all g = 3>(fci , a, fe) € G rcg and for all 
leg given by the following formula: 

(2.5.21) D 1 (tta o Rg) (X) = DiiZ^o,) (r (k^Xh)) , 

where T : q — ► f) denotes orthogonal projection. In particular, as a consequence 
of Kostant's Convexity Theorem 1.3.9, 

(2.5.22) £>! (tt.4 o Rg) (Ad K X) = D x R„ A{g) (c (W ■ X)) 
holds for all X el). 



Proof, (i) Using Lemma 2.5.5 on the if A/f-decomposition, we can de- 
scribe the set ("7r 2 o (g) as 

fao*- 1 )(g) = exp{W B g-X), 

where X e A is the unique element such that 

g = k\ exp Xk2 

holds for some fci, k% e K. We now claim that 

(7r 2 or 1 )( 5 )nO = {expI}. 

So suppose that cxpX' e {n 2 ° <I ) ~ 1 ) (ff)nO. Hence X' = w-X for some w e W a ff, 
and cxpX' = cxp Z for some Z e A. Thus cxp Z = cxp (w • X). It follows that 

for some V e cxp -1 (1). 

Now V can be regarded as an element of the subgroup T of translations of W a s , 
and we can further write 

Z = w' ■ X 
for some w' e W a s. This implies that 

z g w-xn A = {x}, 

and finally cxpX' = cxpZ = cxp Jf, as claimed. This shows in particular that 
the map tta is well-defined. 

For the proof of continuity we define the quotient space A/^ w by 

a ~w aff a' a' = w\a exp^ioj -1 for some (w\,d) e W&g, 

and notice that Cl and A/^ w can be identified as topological spaces. Indeed, 
for any I € | there exists a unique to e W a ff, which we may write as w = 
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(wi,rf)eWx T, such that 

w ■ X = wi(X + d)wi 1 G A 

holds. Hence exp w ■ X = w\ (expXexpd) , and cxpX ~ expw • X G fj. Now 
let a, a' G fi with a ~ a '. Then a' = uiiaexp(d)wf 1 for some (wi, d) G W X T, 
and both sides of the last equation define a AAA'-decomposition for a'. By 
Lemma 2.5.5 it follows that a' = a. This shows that Cl is a complete set of 
representatives for the equivalence relation ~w aff • So the projection map 

pr : — > A/^ WaS , a i — > [a] 

is bijective. It is also continuous, because the action of Wag on A is continuous. 
Now that A/^ w is compact, we see that the map pr is in fact a homeomor- 
phism. Next consider an arbitrary open subset U of Q. Denote the projection 
of A onto A/^ w ff by n^. Then the preimage of U under tta can be described as 

n^(U) = (tt 2 o (U) = $ {K^Z\U),K) . 

As a consequence of the identification of Cl with A/„ w it follows that the set 
7r^ 1 ({7) is open in A. This implies that $ (A', 7r^ 1 ({7), A) is open in G (cf. (g) 
where such a statement is proved for general actions of compact Lie groups on 
manifolds), and finally proves the continuity of the map tta- 
For any a £ A one has the A'AA-decompositions a = 1 • a • 1 and a = kiTTA(a)k 2 
with suitable factors fci, fc 2 G A. Now the middle factor of a AAA-decomposition 
is uniquely determined if it is in addition required to lie in ti. So for a G O, the 
elements a and 7r^(a) have to coincide. This proves the identity (2.5.20). 
(ii) We first consider the action 7 of A x A on G given by 

{k\,ki) ■ g = hgkz 1 - 

A calculation shows that the orbit 7 (a) of a G A intersects the torus A per- 
pendicularly. Namely, for all Xi, X2 G f and Z e t) the following holds: 



(cl ( ex P( <Xl ) - a - ex P(^2))| t=0 ,^ (exp(tZ) -a)| t=0 y 



= {Xia + aX 2 , Za) a 

= {X\a, Za) a + (aX2d~ 1 a 1 Za) a 

= {Xi,Z) 1 + (aX 2 a-\Z) 1 

= {Xi,Z) 1 + (X2 1 a- 1 Za) 1 

= 0. 
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Here we used the bi-invariance of the metric together with the fact that the 
sum g = 6 p is orthogonal. >From this there follows the direct sum decompo- 
sition 



(2.5.23) T a G = T a A © T a C 7 (a), 

if we can show that the dimension of 1 (a) is complementary to that of A in 
G. This is in fact the case, provided that a is contained in fl, as will be proven 
now. 

To this aim we make use of the following result, cf. 1 10 1, p. 294-295. Denote by 
M the centralizer of fj in K, i.e. 

M :={keK\ kXk- 1 =xyx et)}. 

Then the map 

tp : K/M xt) — > G/K, ([k] , X) i — > [exp (Ad k X)\ 

is surjective. It is moreover regular on K/M x(t)\D), i.e. in particular regular 

on K/M x A. 

Now let U C K/M x A be a sufficiently small open subset such that the map 

<p\ v :U^xp{U) 

is a diffeomorphism, and the set W := 7r _1 (y) C G admits a trivialization 

tfj :W — ► V x K 

with 

V> (exp (Ad k X) k') = ([exp (Ad fe X)] , fc'" 1 ) 
for all ([k] , X) e U and k' e K. Then, by construction, the map 

a:UxK — > G, ([k] , X, k') i — ► ^ {cp ([k] , X) , k') 

is regular on U x K. Furthermore, the map 7 is related to a as follows: 

(2.5.24) ~f(k,k'k,expX) =a([k],X,k') . 

The regularity of a implies now that for all X e pr 2 (J7) C A the differential 
of the map a (•, X, •) at the point ([fc] , fc') e pr 1 (C/) x K C K x K has maximal 
rank r = dim G — dim A = dim G — dim A. From equation (2.5.24) it follows that 
the same holds for the map 7(-, •, a), where a = cxpX e exp (pr 2 (J7)) C Q. This 
finally proves equation (2.5.23). 

We now proceed in the proof of formula (2.5.21). By part (i) of the proof we 
have for all a e fl, Z e f) and t sufficiently small the identity 

ir a ((exp tZ) a) — (expiZ)a, 
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and therefore 

(2.5.25) D a ir A (Za) = D ± R a {Z). 
On the other hand, for all tX e T a O^(a) it follows that 

(2.5.26) ir A ( (exp tX)a)= tt a (a) 

since tta is constant on the orbits of 7. Equations (2.5.23), (2.5.25), and (2.5.26) 
now imply that 

(2.5.27) D a ir A (D 1 R a (X)) = D x R a (T(X)) 
holds for all X e g. 

Now let g € G lcg arbitrary. Then g = k\ak 2 for a = tta(9) € and some 
fei, fc 2 £ AT. Then for all leg, 

7Ta ((exp tX) g) — tta ((expiJf) fciafo) 

= TTy^ (fci (exp tfc^Xfci) ofe) 
= ((exptfej -1 ^^;!) a) . 

Differentiating both sides of this equation with respect to t at t = and apply- 
ing equation (2.5.27) yields 

(2.5.28) D g TT A {D 1 R g {X)) = D a n A (D ± R a (k^Xh)) 

= D 1 R a (T(k^ 1 Xki)), 

as claimed. 

The identity (2.5.22) is now immediate from Kostant's Convexity Theorem 
1.3.9, which for fixed k\ € K and let implies that 

r (fcf 1 (Ad K X) fci) = r {Ad K X) = c(W-X) 

holds. Combining this with identity (2.5.21) we obtain (2.5.22). 
We finally note that the map tta is differenciable in g e G lcg , since equation 
(2.5.28) implies that the partial derivatives of tta in all directions D 1 R g (X) of 
T g G exist, with continuous dependency on g. □ 

We now turn to the proof of Theorem 2.5.3. 

Proof. First consider the special case Up E For such an element Up 
there always exists an a which satisfies equation (2.5.9) with k = 1. This is 
evident in the case where g is a simple Lie algebra, since then W acts irre- 
ducibly on f) (cf. [12|, p. 53), so that the Weyl orbit W ■ H d of any H d spans 
f). In the semisimple case f) decomposes as a direct sum fj = 0^ of abelian 
subalgebras, and W still acts irreducibly on each summand f)j . By our choice of 
Hd, the component of Hd in fyj is non-zero, such that as in the simple case the 
set W ■ H d = {Yi,.. ., Yi} spans 

Moreover, for any tuple (k, a, that satisfies (2.5.9) it is clearly possible 
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to steer system (2.5.8) to Up at time t F = a along the trajectory as specified in 
(2.5.10). Let a*(U F ) be the smallest non-negative number such that equation 
(2.5.9) can be solved with a = a*(U F ). 

It remains to show that any trajectory of system (2.5.8) with U (t F ) = U F nec- 
essarily satisfies t F > a*(U F ). In order to prove this we use the projection 
map ir a of Lemma 2.5.6 to replace 1 i-> U(t) by another trajectory of the system 
which joins ?7(0) = 1 to U(t F ) = U F , but is completely contained in A. Since the 
tangents U(t) of U(t) are by definition right translates of the set Ad K H d C g 
we can apply Lemma 2.5.6 (ii) to conclude that the path 

t .— » 7i a (U(t)) =: V(t) 

has tangents V(t) G D x Rv(t) ( c {W ' H d )). By part (i) of the same lemma, 

V(t F )=7T A (Up) = U F . 

However, t ^ V(t) is in general not a trajectory of the control system under 
consideration. But for all t€ [0, t F ], its tangents are of the form 



V(t) ^D-lR 



V(t) 



with X)i=i = 1 an d ii as stated. Therefore, since [ii,3^] = for all i, j 
1, ...,/, 



So steering the adjoint system according to the control 

f te [o,7i], 



E/ — 1 v~^v / 

i=i7J-i)2^i=i7i 



where 



:= / F ft(r)dT, 
Jo 



results in a trajectory £ W(t) on ^4 which satisfies 



7z 



W(t F ) = V(t F ) = U F . 



This is again immediate from the pairwise commutativity of the vector fields 
Yi, 2 = 1, Moreover, it follows from the definition of a*{U F ) that a*{U F ) < 
t F , as we have claimed. Finally, a trajectory from U(0) to U (a* (U F )) = U F , and 
therefore (by construction) a time-optimal trajectory, is given by t h-> W(t), and 
this has the form as stated. 

Now let U F G 6 be arbitrary. By definition of 6 we can choose k e K such 
that U F := kUpk^ 1 G 0. The trajectories joining the system from 17(0) = 1 to 
Up, respectively to U F at fixed time t F > are in bijective correspondence to 
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each other, see the proof of Proposition 2.3.7 (i). Namely, if 1 1 — > X(t), t e [0, t F ], 
is a control which leads to a trajectory t i-> [/(£) with U(t F ) = U F , then t 
fcX(t)fc _1 , f g [0, *f], is a control (with values again in Adx Hd) which leads to 
a trajectory with endpoint Up, and vice versa. 

Hence a* (Uf) = a* (Up^j , which is, by the first part of the proof, the smallest 
non-negative value of a such that 

U F = fccxp ^aJ2^ Y ^j 

can be solved for some k e K. At the same time, this a is the smallest one 
possible such that 

Up = k~ l U F k = fc _1 fcexp ^aY^PiYij k^k 

can be solved with k e K, which means that the value of a* (Uf) is as stated. 
The statement on the time-optimality of the trajectory t U(t) with U(tp) = 
Uf follows from that on time-optimal trajectories in the special case Uf e Cl by 
using again the correspondence of trajectories with if -conjugated endpoints as 
formulated before. □ 

COROLLARY 2.5.8. Assume the Lie groups G and K to satisfy the prerequisities 
of Theorem 2.5.3, and let Pf € G/K arbitrary. Then the set 

(2.5.29) X F := 7t _1 (Pf) n 6 C G 

is non-empty. Furthermore, the canonical projection of any trajectory of type 
(2.5.10) with endpoint in X F yields a time-optimal trajectory between n(l) = K 
and Pf for the reduced system 3. 

PROOF. Let Pf = gK for some g e G. Then g yields a K Aif -decomposition 
of the form g = fciafcf 1 fc 2 with k\, ki e K and a e 0. Hence fciafcf 1 e 7t _1 (Pf) H 
6 = X F . 

Now let t i ^ u(t) e Ad K H d , t e [0, tp], be a control for system 3, such that the 
corresponding trajectory t i-> P(t) has terminal point = Pf- Let system 

2 evolve according to the same control function u, and denote the resulting 
trajectory by t U(t). From the proof of Proposition 2.3.7 (ii) it follows that 
P(t) = it o U(t) holds on [0, t F ]. In particular, Up := U (t F ) satisfies ir(U F ) = Pp. 
In complete analogy to the proof of the previous Theorem 2.5.3 we can apply 
the projection map tta together with a conjugation to the trajectory U in order 
to obtain a trajectory t W(t), t e [0,t F ], which is of the form (2.5.10) and 
satisfies W(t F ) = Upk' for some k' e K. Namely, if 

Up = kiTA^k^k'- 1 

is a K Aif -decomposition of U F , then we will set V(t) :— kit a(U '(t))^ 1 for t G 
[0, t F ]. As in the proof of Theorem 2.5.3 we then obtain from the path t V(t) 
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a trajectory t i-» of system 2 which is of the special form (2.5.10) and has 
endpoint W(t F ) = V(t F ) = U F k'. 

We denote the control function which generates this trajectory by v. Let system 
3 evolve according to the same control function v, and denote the corresponding 
trajectory by t *-> Pi(t). It follows as before that 

P l (t) = iroW(t) 

holds on [0, t F ]. In particular, 

Pi(t F ) = ir{U F k') = n(U F ) = P F . 

This shows that any trajectory t i— > P(t) which reaches P F at time t F can be 
replaced by a trajectory t h-> i'i(i) which also satisfies Pi(t F ) = P F , and has the 
special form as stated. 

It remains to show that any trajectory of that type is indeed time-optimal for 
system 3. So let W F e X F be arbitrary, and denote by t W'(t), t e [0, t' F ], a 
trajectory of type (2.5.10) which reaches W' F at time t' F . The statement of the 
time-optimal torus theorem is that such a trajectory is time-optimal for system 
2. Since n(W F ) = tt(W f ) = P F , we see that W F = W' F k' holds for some k' e K. 
Thus n A (W' F ) = n A (W F ) =: a e Cl. Furthermore, as W F ,W' F e 6, it follows 
that both W F and W' F are conjugated within K to a. Hence W F = ^'Wpk" -1 
for some k" G X. Finally, as a consequence of Proposition 2.3.7 (i), it follows 
that t F = t' F , as otherwise t i-> W(t) and i ^ W'(t) could not both be time- 
optimal trajectories for system 2. 

This shows that the projection of t h-> under 7r again yields a time-optimal 

trajectory for system 3 between K and Pp. □ 

We finally describe how the combination of the Equivalence Theorem 2.3.6 with 
the previous Corollary 2.5.8 can be used to solve the time-optimal control prob- 
lem for the unreduced system (2.3.1) we have originally been interested in. We 
therefore keep all the assumptions made in the time-optimal torus theorem, 
and let Up e G arbitrary. From Corollary 2.3.12 it follows that t F := t in{A (U F ) 
equals t in i 3 (U F K). Corollary 2.5.8 can now be used as follows to construct a 
trajectory t U(t) for system 1, which satisfies the time-optimality condition 
U(t F ) - Up. 

(1) Decompose Up as Up = kiak^ 1 k 2 with k±,k 2 G K and a e £l. This is 
just a K AfT-decomposition as introduced in Lemma 2.5.5. By defini- 
tion of Cl, the factor a in that decomposition is uniquely determined. 

(2) Set V F := fciafcf 1 G 6, where 6 C G is as defined in Theorem 2.5.3. 
Let t i — ► V(t), t G [0,£f], be a trajectory for system 2 of type (2.5.10) 
such that V(t F ) = V F holds. By Corollary 2.5.8, such a trajectory 
exists and is time-optimal. 
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(3) By construction, the trajectory t ^ V(t) consists of a finite number of 
geodesic arcs of the form 

1 1 — ► k' cxp (tY j+1 ) k'- l V{tj), t e [tj,t j+1 ] , 

where k! e K,Yj e W ■ H d , and [tj,tj + \] is a subinterval of [0,£f] as 
specified in Theorem 2.5.3. Write Yj as k"Hdk!'~ x for some k" G K. 

(4) System 1 can be steered from V(tj) to 

V(t j+1 )=k'exp(t j+1 Y j+1 )k'- 1 V(t j ) 

within time tj+i - tj by first producing the element {k'k"y l V{t ) 
within zero infimizing time. Evolution under the influence of the drift 
operator H d for time tj+i - tj transfers the system in a second step 
from {k'k")- 1 V(tj) to cxp (t j+1 H d ) {k'k")' 1 V(tj). The point V(t j+1 ) is 
finally reached from exp (t j+1 H d ) (k'k")~ L V(tj) within zero infimizing 
time. 

(5) The iteration of such so-called pulse-drift-pulse sequences transfers 
the unreduced system within infimizing time tp from V(0) — 1 to Vp. 
The point Up = Vpk 2 differs from Up by only an element of K and 
thus can also be reached within infimizing time tp. 



CHAPTER 3 



Discussion of some Explicit Spin Systems 

3.1. General Considerations 

In this chapter we shall apply the results obtained so far to a number of con- 
crete examples. While the Equivalence Theorem 2.3.6 and the Time-Optimal 
Torus Theorem 2.5.3 have been formulated for right-invariant control sys- 
tems on arbitrary compact Lie groups (with additional assumptions such as 
semisimplicity to be satisfied in 2.5.3), we are now focussing on the particular 
case G n = SU (2") and the time-optimal control of certain n-particle spin sys- 
tems. As a typical example of that kind of control-system we want to discuss 
system (2.2.2) of Section 2.2, where the subgroup K n generated by the con- 
trol Hamiltonians was K n = SU(2)® n . The equivalence theorem allows us to 
replace this control system by the corresponding reduced system 

(3.1.1) P = XP, P(0) = K, XeAd Kn H d 

on the homogeneous space G n /K n . Let g n := su(2") and t n := su(2)®™ be the 
Lie algebras of G n , respectively of K n . 

To proceed in our discussion of time-optimal control, we first of all determine 
those examples in the family (3.1.1) of control systems which meet the require- 
ments of Theorem 2.5.3, respectively its Corollary 2.5.8. The following theorem 
shows that these are satisfied for n = 1 and n = 2 only. 

THEOREM 3.1.1. The pair t n ) is a symmetric Lie algebra pair if and only if 

n < 2. 

PROOF. In course of proving this theorem we need to work in the tensor- 
product basis of g n = su (2™) as introduced in Example 1.6.4. This basis com- 
prises the n-fold products of elements {1, I x , I y , I z } with at least one factor 
being different from 1. 

Assume (g„, t„) = (su (2™) , su(2) <8> "), n e N, to be a symmetric Lie algebra pair 
with Cartan-like decomposition Q n = 6„ © p„. We claim that the set 

X := |J X 3 

with X 3 as defined in Example 1.6.4 is a basis of p„. Because £„ is spanned 
by X\, the R-linear span of X is seen to be complementary to 6„ in g„. It 
remains to show that X is orthogonal to t„ (cf. Lemma 1.2.6, and use that q„ 

69 



70 3. DISCUSSION OF SOME EXPLICIT SPIN SYSTEMS 

is semisimple). Therefore, let 

A := f 3 l <8 ... <g> I ai ® ... 8> 7 Qj . 8> ... ® 1 G Xj, 

j > 2, and 

B := 1 <8 ... <8 Ip ® ... ® 1 G Xi. 

Let fc G {1, j} be an index such that the position of I ak in A differs from that 
of Ip in B. Choose 7, 6 G {2, y, z} such that [7 7 , = 7 Q3 and set 

C := 1 <g> ... ® Zy <g> ... <S> 1 G Xi, 

I 1 at the same position as I aj , and 

D := i Ej l® ... <g) I ai ® ... ® J^.^ ® ... ®7 { ®...®le X jt 

Is at the same position as I aj and all other positions coinciding with those of 
A. Using Lemma 1.6.3 (iv) we find that A = [C, D] and [B, C] = 0. From the 
ad-invariance of the Killing-form k it now follows that 

k {A, B) = k ([C, D],B) = k (D, [B, C]) = k (D, 0) = 0. 

This shows A G X^. So 

(X) C (X 1 ) ± = ^=p n , 

and finally, for dimensional reasons, {X} = p n . 
Now let n > 3 and consider the elements 

A-l := I x ® I y ® I z ® 1 <8 ... <8 1 G p n 

and 

A 2 := i/j, <8 Jj, <8 1 <8 ... <8 1 G p„. 
We use Lemma 1.6.3 (iv) to calculate 

-i[A u A 2 } = [I x ®I y ®I Z ® 1 <8 ... <8 (gi/y <8> 1 <8 ... <8> 1] 
= [4,4] 8> (4 <8 4 <8 1 <8 ... 8 1) (/ y 8 1 8 ... 81 1) 

+44 ® [4 ® 4 ® 1 <8 ... <8 1,4 <8 1 <8 ... <8 1] 
= I z ®Iy ®I Z ® 1<8... <8 1 

-J z 8 (8 I y ] <8 (4 <8 1 <8 ... (8 1) (1 <8 ... 8 1)) 

-4 8 (Iy ® [I z <8 1 <8 ... (8 1, 1 <8 ... (8 1]) 
= -7 2 <8 1 <8 4 <8 1 <8 ... ® 1 

-I z <8 (0 <8 (4 <8 1 <8 ... <8 1) (1 <8 ... <8 1) + Iy ® 0) 
= -I z ® 1 <g> I z ® 1 <8 ... <8 1 

So [p„, p„] is not contained in 6„, and (g„, t n ) cannot be a symmetric Lie algebra 
pair. 
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In the trivial case n = 1 the assertion clearly holds. 

Now let n = 2. We have to check that q 2 admits an involutive Lie algebra 
automorphism 9 with i 2 as its 1-eigenspace. On the basis X\ UX 2 as introduced 
before define 

e(x) = i x > ifx ' x - 
{ -X, iflel 2 , 

and extend 9 to a linear map on q 2 . So 9 is by definition an involution. A calcu- 
lation now shows that the commutator [Yi , Y" 2 ] of any two elements Yi,Y 2 £ X 2 
is contained in i 2 , while for all Z e t 2 the commutator \Y\,Z] is in the linear 
span of X 2 . So 

e[Y 1 ,Y 2 } = [Y u Y 2 ] = [e(Y 1 ),e(Y 2 )} 

and 

fl[r 1 ,z] = -[y 1 ,z] = [fl(y 1 ),fl(z)] ) 

i.e. 9 is also a Lie algebra automorphism. □ 

Summary 3.1.2. Our discussion so far lead to the result that the problem 
of time-optimal control of an n-particle spin system with Hamiltonian H = 
H d + J2T=i i v jxl]x + v ]yljy +v ]Z Ij Z ) G su(2") can be solved by applying the 
results of Section 2.5 if and only if n e {1,2}. This will be carried out in detail 
in the subsequent two sections. 

Although the number of cases where Theorem 2.5.3 on time-optimal control 
applies is quite limited as long as we are only interested in Hamiltonians of 
the special form above, one nevertheless could imagine other interesting right- 
invariant control systems on G — SU(n) or any other compact semisimple Lie 
group G that allow for the application of that theorem. 



3.2. Single Particle Systems 

We turn to a discussion of control system (3.1.1) in the case n = 1. Here we 
assume the exterior magnetic field to excite rapidly the a>component I x of the 
spin I = (I x ,I y ,I z ) and consider I z to be the drift Hamiltonian. This leads to 
the unreduced control system 

(3.2.1) U=(I Z + vl x ) U, U(0) = 1, veR 

on G = SU(2). I x generates the Lie subgroup 



(3.2.2) K 



cos t — sin t 
sin t cos t 



t e 



which is isomorphic to U(l). So the corresponding reduced system is 

(3.2.3) P = PX, P(0) = K, XekA K I z 

on the two-dimensional homogeneous space G/K = SU(2)/U(l). This space is 
diffeomorphic to the projective plane MP 2 , as will become clear later. The set 
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K&kI z of control variables is a circle around zero, cf. Example 1.3.10. 

The pair (g, 6) = (su(2), u(l)) is symmetric with Cartan involution 6 defined by 

(3.2.4) 6 (al x + 0I y + yl z ) = al x - pl y - 7 I, 

Hence Theorems 2.5.1 and 2.5.3 apply to system (3.2.3). So this system is in 
particular controllable. 

The orthogonal complement of { = M.I X with respect to the Killing form on su(2) 
is 

(3.2.5) p = RI y + RI Z 

which leads to the Cartan-like decomposition g = t © p. One can identify p 
with the tangent space T K (G/K) of G/K and then argue that the geodesies 
emanating from K e G/K are of the form 

(3.2.6) 1 1 — ► (exp tX) K, X e p, 

cf. 1101 . p. 212. Since G/K is compact, the Hopf-Rinow theorem implies that 
any point gK e G/K is of the form gK = (cxpX) K for some X e p. Because 
exp X is a symmetric matrix if X is symmetric, we see that the points of G/K 
can be represented by the symmetric unitary (2 x 2)-matrices. These comprise 
the set 



f ( ia \p \ 






{ \ i/3 -ia J 





(3.2.7) Sym(2) 



<p, i> e [o, 2tt] 



/ cos ^e 1,p i sin tp 

which is a 2-dimensional submanifold of G. Moreover, the map 
(3.2.8) V : Sym(2) — > S 2 C R 3 . 



(cos ip cos ip, sin ip cos tp, sin tp) 



I cosij}e lv isin^ \ 

I isin?/; cosV'e -1 ^ J 
is a diffeomorphism between Sym(2) and the 2-sphere § 2 . A calculation now 
shows that two elements gi , g 2 G Sym(2) are in the same coset modulo K if and 
only if .gi = g 2 or g x = -g 2 . Hence 

(3.2.9) G/K = Sym(2)/^_ s = S 2 / g ^- g = MP 2 . 

In order to solve the time-optimal control problem related to the unreduced 
system (3.2.1) we shall procede as outlined at the end of Section 2.5. To this 
aim we fix the maximal abelian subalgebra \) := RI Z of p and determine the 
Weyl orbit W ■ H d of H d , the action of W & g on f), and the sets fl 9 C G. 
The action of the affine Weyl group W a s on f) is generated by reflexions 

(3.2.10) X i — > -X 
and translations 

77-77 

(3.2.11) X^X + —I Z , neZ, 
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as is seen from the root space decomposition of g with respect to f), cf. Example 
1.3.7. The cells in f) are then the sets 

(3.2.12) L n := («»« + !)}. n G z - 

We fix the cell A := L and set 



(3.2.13) 



f2 := exp A 











(»■!) 



C A. 



Here A denotes as usual the maximal torus of G with Lie algebra t). The set 
6 = Ad^^ of Theorem 2.5.3 is in this situation the following: 



(3.2.14) 



9 



7T 7T 

2' 2 



cos-0e lv isin^ 
isin-0 cos?/>e~ lv 

Note that Sym(2) = 6U-6 so 9 can be thought of as a hemissphere in Sym(2) = 
§ 2 . 

Given a terminal point Uf € G, a time-optimal trajectory between C/(0) = 1 
and [7f is now obtained in the following manner. 

(1) Decompose Uf as Uf — U\ki with U\ e and fci e if. This can be 
accomplished by making use for instance of (2.5.4). 

(2) The general form of a matrix U\ C 6 is 

cos ^e 1 ^ i sin ^ 
isin^ cosV'e -11 ^ 

Calculate the parameters y and ^ of the matrix {/i determined in step 
(1) and set 

a := cxp (al z ) £ fi, 

with 

7T~ 



7T 7T 

2' 2 



0, 



a := arg ^cos ip cos + i^/l — cos 2 i/> cos 2 ipj 6 

By construction, U\ = kak^ 1 holds for some k e K. 

(3) Calculate kJ( such that Ui = fcAifc" 1 . 

(4) A time-optimal control sequence to generate Uf is 

1 — ► k~ x k\ — ► ak _1 k\ — ► kak~ x k\ = Uf- 

Here the first and the last arrow mean synthesizing k~ x k\ and k by 
so-called hard pulses (the infimizing time for accomplishing this being 
equal to zero), while the middle arrow denotes evolution of the system 
under the influence of the drift Hamiltonian 7, for time a. 



3.3. Two-Particle Systems 



In this section we discuss control system (3.1.1) for the special case of n = 2 
spin-particles, and assume that the x- and the y-component of each of the spins 
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may be excited individually. The problem of controlling the spin of such a 
system then reads 



(3.3.1) 

with 
(3.3.2) 



U=\H d + J2 VjHj ] U, U(0) = 1, Vj e 

3 = 1 



H 1= I 



Ix ) 



Hn=I 



Ho = L 



2x ■ 



H A = l 



2y 



The drift operator H d needs to be chosen within a maximal abelian subalgebra 
t) C su(4) (which will be specified later) subject to the restriction that it is not 
contained in any root hyperplane. The elements Hj, j = 1, ...,4, generate a 
subalgebra t isomorphic to su(2) ® su(2), and we already know by Theorem 
3.1.1 that the pair (su(4),su(2) <8>su(2)) is a symmetric Lie algebra pair. This 
allows us to argue along the lines of the previous section. 

We denote by K = SU{2) ® SU(2) the connected subgroup of G = SU{A) with 
Lie algebra t. So the resulting reduced system is 



(3.3.3) 



P = PX, P(0) = K, X G h&K H d , 



on the 9-dimensional homogeneous space G/K. In order to describe this space 
more succintly, we first of all observe that the Lie algebras t = su(2) ® su(2) C 
su(4) and so (4) = { X e su(4)| X + X T = 0} are isomorphic. An isomorphism 
ip : t —y so (4) is given by conjugation with the unitary matrix 



/ 1 



(3.3.4) 



U --=2 





-1 




1 \ 



— i 




This is for instance seen by using Lemma 1.6.3 (vi) in order to represent the 
elements of a basis oft by (4 x 4) -matrices, such as 



1 = 



o 
o 

-i 





i 








-1 

i 






1 







\ 



\ 

1 



/ 



/„ = 



\ 



-i 

-ij 



i 





-1 





i 









-1 

\ 


i 

1 oy 



\ 




h = 



o \ 







-i 

i 

-i J 
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and then by checking that the map ip sends this basis to a basis of so (4). 
The Lie algebra isomorphism <p can be integrated to a Lie group isomorphism 
$ : K — > 50(4) =: K which likewise is given by conjugation with U. This also 
shows that the homogeneous spaces SU{A)/ (SU(2) ® Sf7(2)) and SU{A)/S0{4) 
are diffeomorphic as S'C/(4)-homogeneous spaces. Namely, an equivariant dif- 
feomorphism is given by the map 

(3.3.5) i/):SU(4)/(SU(2)®SU(2)) — > SU(4)/SO(4), gK i — >UgU~ 1 K. 
The map ip is well-defined: 

il){gkK) = UkgU~ l k 

= UgU~ 1 UkU- 1 K 

= UgU~ x K 

= ip(gK) 

holds for all k e K. 



We remark that the space SU{4)/ '50(4) appears as a symmetric space of type 

A I in Example 1.2.7. It also can be shown to be diffeomorphic to the Grass- 

mannian manifold of 3-dimensional subspaces in IR 6 , cf. [28 1, p. 322. 

An argument analogous to that in the previous section shows that the elements 

of G/K = Sf7(4)/50(4) can be represented (again not uniquely) by those of the 

space Sym(4) of symmetric unitary (4 x 4) -matrices. In this case a calculation 

yields 

(3.3.6) G/K £ Sym(4)/^, 
where 

(3.3.7) gi - 02 .9i = gid, de D, 
with 

(3.3.8) D := {±1, ±diag(l, 1, -1, -1), ±diag(l, -1, 1, -1), ±diag(l, -1, -1, 1)} . 

We now turn to a discussion of the time-optimal control problem as formulated 
in Section 2.3 in the here relevant case of (g, t) = (su(4),so(4)) constituting a 
symmetric Lie algebra pair. The Cartan-like decomposition of the Lie algebra 
is 

(3.3.9) = e®sym(4), 

where sym(4) = |leg| X T = X}. Indeed, p := sym(4) is the orthogonal com- 
plement of 6, because it is complementary as a vector space, and for all X e p, 
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Y e t we compute (making use of equation (1.1.16)) that 

k(X,Y) 



= 8tr (XY) 




4 4 

= 8^^2 X jk Y kj 




j=i k=i 




= X jkYkj - & 


i ^ X jk Ykj 


j<k 


j>k 


- a / ; X jk Y kj - c 


' 2 , A jk^k] 


j<fe 


3>k 


= 8 X k jY jk - S 


1 ^ X jk Y k: j 


j<k 


3>k 


= 8 y~] Xj k Y k:j - g 


'< XjkYk-j 




3>k 


- o, 





as X fei = X jfe and Y~ fe j = -Y} fc . 

The next step is to determine a root space decomposition of g. Choose f) := 

Rffi + Rff 2 + Ki7 3 C p with 

(3.3.10) Hi :=diag(i,-i,0,0), H 2 := diag (i, 0, -i, 0) , # 3 := diag (i, 0, 0, -i) 

to serve as a maximal abelian subalgebra of g. Let A e G the maximal torus 
with Lie algebra t). The root space decomposition of gc — sUC with respect to f) 
is then given by 

(3.3.11) g: h: :-00,> 

i¥=3 

Qij as in Example 1.3.7. The coroots have also been determined before; they 
are the following 

(3.3.12) Yi = H lt Y 2 = H 2 , Y 3 = H 3 , Y 4 = H 2 -H u Y 5 = H 3 -H u Y 6 = H 3 -H 2 , 
together with 

(3.3.13) Y j+6 := -Yj,j = 1,...,6. 

It is easily checked that reflexion in f) on the hyperplane perpendicular to Y e f) 
is given by 

(3.3.14) X — >X- 2 ^' Y ) y. 

\\Y\\ 2 

>From this we obtain the Weyl orbit W ■ H d of the element H d = X)»=i e § 
by reflexion on the root hyperplanes Y j L , j = 1, 6, which is, in coordinates 
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with respect to the ordered basis (Y 1 ,Y 2 ,Y 3 ) of f), the set 

W ■ H d = {(ai,a 2 ,a 3 ) , (ai,a 3 ,a 2 ) , {a 2l a ll a 3 ) , (a 2 ,a 3 ,»i) , (a 3 ,ai,a 2 ) , 
(a 3 , a 2 , ai) , (-ai - a 2 - a 3 , o 2 , a 3 ) , (a 3 , a 2 , -ai - a 2 - a 3 ) , 
(ai, -ai -a 2 - a 3 ,a 3 ) , (a 3 ,ai - a 2 - a 3 ,ai) , 
(ai,a 2 , -ai -a 2 -a 3 ),..., (01 - a 2 -a 3 ,a 2 ,ai)}. 

In the same manner as in 3.2 we determine the cell A C \) to be the convex hull 
of 0, | .Hi, f i? 2 , and f if 3 , and set Q := exp A C A It is easily checked that any 
a e A permits a decomposition a = aid with ai e and d e D, where the group 
D is as denned in (3.3.8). 

By proceding as described at the end of Section 2.5 we obtain a time-optimal 
trajectory between the identity and any given point Uf € G in the following 
way. 

(1) Perform a polar decomposition of Uf to obtain k\ e K and ?7i e Sym(4) 
with Uf = U\k\. If U\ £ 6 then replace ?7i suitably by f/i<i, and fci by 
d _1 fci, where d e D. 

(2) Diagonalize f/i as ?7i = kak' 1 with k e K and ae!i. Write a in the 
form 

24 

a = J^J cxp (a(3jZj) 

3 = 1 

with X) fcli A — 1» an d ^24 the elements of the Weyl orbit W-H d . 
Choose the parameter a > to be the smallest one possible. This a 
then satisfies the minimality condition of Theorem 2.5.3. 

(3) Steer system (3.3.1) as depicted below: 

1 — ► fc- 1 ^ 

— ► exp (a[3 2 4 Z 2i ) fc _1 fci 

— ► exp(a/3 23 Z 23 )exp(a/3 24 y^24)fc" 1 fci 



24 



J^exp(a/3 j Z 7 ) 



v^ 1 

24 \ 

J|exp(a/3 j Z 7 ) fc- 1 /!!! = t/ F . 
y=i / 

Here again the first and the last arrow means producing movement 
within the subgroup K of G and is realized by performing a so-called 
hard pulse, while the middle arrows symbolize evolution of the system 
in direction of Z\,..., Z 2i for times a/?i,..., a/3 2 4, respectively. 
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